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LXXXIX. The Statistics of Correlated Events.—I. 


By C. Doms, 
Clarendon Laboratory, Oxford *. 


[Received, June 20, 1950). 


SUMMARY. 


The present paper is an attempt to formulate the mathematical 
properties of events occurring in time with a correlation between them. 
One effect of the correlation is a change in the mean square deviation of 
the number of events occurring in an interval of time ¢. By analogy 
with the shot effect in the presence of space change, this is denoted by a 
factor I for large ¢t, and I is evaluated in terms of the interval 
distributions between events. For finite intervals ¢ a corresponding 
factor y*(t) is introduced, which usually tends to 1 as t+0. A correlation 
function between events occurring at times separated by an interval + 
is defined, and its relation to y(t) is discussed. A generalization of 
Campbell’s Theorem applying to correlated events is derived. Finally 
the problem of the random partitioning of correlated events is discussed. 


§1. INTRODUCTION. 


Suppose that events occur randomly and independently in time, the 
probability of an event occurring in the interval (t, t+dt) being Adt; 
such events will be referred to briefly as ‘‘ random events’. There are 
a number of problems in which the concept of random events can be 
regarded as a valid physical approximation, and as a typical example 
we may take the emission of electrons from a cathode in a valve. The 
properties of random events have been studied extensively, particularly 
in connection with the shot effect and the properties of random noise. 
Little attention, however, seems to have been paid to the properties of 
‘correlated events’, that is, events in which the probability of an 
occurrence at a certain time is dependent on the distribution of events 
which have occurred previously. 

The most common physical problem in which there is a correlation 
between events is the shot effect in the presence of space charge, and 
here the correlating mechanism is enormously complicated. However, 
the essential physical results of the correlation can be expressed quite 
simply. For random events it can readily be shown (e. g. Bateman 1910) 
that, if m is the number of events occurring in a finite interval of length ¢, 


BGG aN a tetas vss, Maye cs SdGL) 
He eee) ee oe. Es te (2) 
* Communicated by Prof. M. H. L. Pryce. 
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where E(w) denotes the expectation, or mean value of w. In the presence 
of space charge there are two changes in the above formule. Firstly, 
the mean number of events is reduced, so that we now have 

Hint go ee SO Se 
where « lies between 0 and 1. Secondly, the fluctuations are smoothed, 
so that a4 

E(n)—[B(n)P oP taht, 2 2. 2 
where I” lies between 0 and 1, and is usually referred to as the space 
charge reduction factor. It should be noted that although (2) is valid 
for an interval of any length, ¢, the definition of J in (4) usually refers 
to the asymptotic value for large t; this point will be considered in more 
detail later in §5. 

A good deal of work has been done on the evaluation of J” in particular 
cases of physical interest (see North (1940) for references). In point of 
fact, however, the relations (3) and (4) arise immediately a correlation 
is introduced between events, and a fundamental analysis in terms of 
the theory of probability should be possible, giving « and J? in terms of 
the correlations between events. As far as the writer is aware, the only 
attempt along these lines is that of Whittaker (1938). 

In the present paper it is proposed to consider classes of correlated 
events in which the probability of an event occurring at a given time 
depends on the time of occurrence of the previous 7 events. This means 
that any r intervals between successive occurrences are correlated, and 
such events will be referred to as events of class ry. For many types of 
non-random events this. is not a particularly simple definition. For 
example, if we consider a purely random primary series in which each 
primary event is followed by a secondary event at a fixed time 7 after it, 
one can easily see that the complete set of primary and secondary 
events are of class oo, even though the properties of the set are easy to 
discuss by elementary means. Nevertheless, given any set of events, 
the property immediately accessible to observation is the length of 
successive intervals, and one of our purposes is to relate the interval 
distributions to « and J” defined in (3) and (4). 


§2. Events or Cuass l. 

We first consider in detail the properties of events in which the 
probability of an occurrence depends only on the time which has elapsed 
from the previous occurrence. This type of problem arises in the theory 
of particle counters, and has been discussed by a number of authors 
(see e.g. Feller 1948); it has also arisen recently in connection with 
genetics (Owen 1949). The advantageous feature of events of class 1 
is that an exact solution is available in terms of Laplace transforms 
(Domb 1948, Owen 1949), and the physical properties of the solution 
can readily be determined. We now proceed to deduce this solution - 
much of the development will parallel previous discussions, but the 
results are required in a somewhat different form, By 
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Suppose that if no event has occurred in an interval é after a given 
event, the probability of an occurrence in the interval (€, é+dé&) is 
Af(é)dé. In cases of practical interest, we should expect f(€) to tend to 
unity for large €. Let 4(€) be the probability that no event occurs in 
an interval € after a given event. Then 


$(E+dE)=$(€)[1 —Af(E) dE] 
or Hee (EE)= f Fluydu). od Perro tees (79) 


The probability that the interval between two successive events has 
length between € and €+dé is thus 


Mf(Ee-MOGE—GE)de. 2 ees (6) 


Now let u,(t) be the probability that » events occur in the interval 
(0, t). We can divide w,(t) into mutually exclusive classes in which the 
last event occurs in (t—n, t—n+dn), the probability being w,,(t, n)dy. 
It is easy to see that 


Ut, 7=e Mu, (t—7, 0) =e "Ow, (i—n), . . - (7) 
; t t 

so that w,(t)= | U,(t, n)dq= | ae ue A uo 18) 
0 0 


We may rewrite (8) symbolically in the form 
u,(t)= Pw,,(t), ve ee? are ee ret) 


where P is the linear operator defined by (8). 

w,(t) can similarly be divided into mutually exclusive classes in which 
the last event but one occurs in (t—, t—¢+-dZ), and as above, we deduce 
that 


w,(t)= [en alt—Oat6) ae, BGP tS Bees ae er 10) 


or symbolically Ue y= Qin a(t) Qs ayh), . . - ~ «2 (11) 


where Q is the linear operator defined by (10). “8, 

In discussing the probability distribution of numbers of events it is 
usually convenient to introduce the moment generating function, for 
example, 


w(O, t)= Sw, (te Dee eat) a et ere ox (12) 
1 i | 


Now for any finite ¢ we can reasonably assume that the probability of n 
events occurring tends to zero as n tends to infinity. Thus we may write 
symbolically 4 
w,(t)e : 

a li ere 0) 
w(G, t) 1 — Qe? ? ( ) 


Axo 
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so that w(@, t) satisfies an integral equation. We have discussed the solu- 
tion in terms of linear operators, since the whole development can be 
formally generalized to events of class r, provided the operators P and Q 
are .re-defined appropriately. 

We now particularize to events of class 1, and take Laplace transforms* 
int. The Laplace transform of any function will be denoted by a capital 
letter ; for example G(p) will be used for the Laplace transform of g(¢) 
defined in (6). The operators P, Q now have a very simple interpretation, 
Q corresponding to a multiplication by G(p), and P to a multiplication by 
[1—G(p)]/p. Equation (13) becomes 


wpe? ss W, (Pp) ge ee 


M0 PT Gp) — GG) 


and correspondingly, 
wy(p)[1—G(p)]_ 
ple~’—G(p)] 


To specify the solution completely we should now consider the initial 
conditions at t=0, and hence determine w,(t). But in fact, as we should 
expect, the asymptotic solution for sufficiently large ¢ is independent of 
w,(t) and depends only on the form of g(t). This asymptotic solution can 
be derived by expanding (15) as a series of residues (Domb 1948), 


U(6, p)= (15) 


uO, t)=2A,(9, them, of Ftp eB ke Meaney) 
where the m, are the roots of 
e7°—G(p) =U. to oe er 


The dominant term is given by the real root m, of (17), and one can readily 
obtain an expansion of this root in powers of 0, the coefficients of which are 
the asymptotic values of the cumulants of the w,,(t) distribution. We find 
that 


K(n?)—[B(n)? ~ (92/93 —19)t, 
where Ope ib t"g(t) dt 
0 


and is the rth moment of the g(t) distribution. Hence in the notation of §1, 


(19) 


* We depart here from the notation of Domb (1948) and define the Laplace 
transform of a function y(t) as ik y(tjeP' dt. This is now adopted by many 
writers, and leads to some simplication of formule, 
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It is important to observe that I’? depends only on the first two moments 
of the g(t) distribution... Thus g(t) could be replaced by any other distri- 
_ bution having the same first two moments without altering I; as a 
particular example g(t) could be formally replaced by an error distribution 
whose first two moments are g, and g,, although this is not feasible 
physically, since ¢ cannot be negative. Similar considerations apply to 
higher moments of the w,,(¢) and g(t) distributions, but these are not usually 
of physical interest. 

We shall now give a few practical examples to illustrate the evaluation 
of (19). 

(i) An «-particle counter of constant recovery time B . g(t) is the sum of 
two distributions, a fixed length 8 and a distribution Ae~“ du. 


g1=B+1/A, a=1/(1+A8), i (20) 
IJ2—Gi= 1)», M=1(1-+Agy, 
so that I” is always less than 1, as we should expect. 
(ii) Consider g(t)= a e-“; y=I1 corresponds to the uncorrelated 


case, 0<v<1 corresponds to an attractive correlation, and y>1 to a 
repulsive correlation. In this case G(p)=(A/p-+-A)’, and equation (17) can 
be solved exactly giving 


m,=—)+)e®”, ee ee eno) 


(There are no other roots of (17) in this case, the remainder being given by 
an integral round the branch point p=—A). We readily deduce that 


a==L/y, tT EN TE) Se oes Oceans TY 5) 


so that [<1 for repulsive correlation, and >1 for attractive correlation. 
(iii) Let g(t)=8(t)+(1—f)Ae; this means that there are finite 
probabilities 8, 8? .. . . of the simultaneous emission of 2, 3, events. Then 


PSU eet IRS Bien ee 09. A 28) 


This is a particular case of a more general problem considered by Whittaker 
(1938). 


§ 3. FoRMAL GENERALIZATION TO EVENTS OF CLASS’. 

We now suppose that if the intervals between the last r occurrences are 
E,, €o,-.. €)., the probability that an event occurs in the interval 
(7, n-+-dn] from the last occurrence is Af (£4, 2... & 13 9) (this is a 
conditional probability). The relations (5) to (13) can be formally 
generalized, many of them now assuming the form of conditional proba- 


bilities. Thus 
SS CR ote A) el al 


[ Fis. fai MIE [FG bras mae |. j aaa 
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and the probability h(é,,... €,-1; 7) that the interval following a given 
event lies between 7 and 4-++dy, when the previous (r—1) intervals are 
Si + ++ Fp Is 

Aflac en ye CGR a rat dn tee ee Od 


As before u,,(¢) can be divided into mutually exclusive classes 


WU Caso Sra) 


and /we.can) define, e.(t; “£), «1. £,.4) [=a (ts oa ee eee 
operators P and Q are now given by 


rc os) rt : 
Pw,,(¢; Fs ae éJ=| dé, dia a dE S| de iG otras » 


Xw,(t—n, EL cee aa 


CONCH oo allt) < Mn, Eye .2 £a) ey 


XW, (t— E19, €1 =» Sea) ON. 
(26) 


Equation (13) now furnishes an integral equation for w(6, €,,... €,—1); 
the operator Q being given by (26). 

Unfortunately no transformation seems to be available in the general 
case to provide the analogue of the Laplace transform solution for events of 
class 1. On physical grounds we might expect the solutions to have a 
fairly simple asymptotic character, and we shall find this borne out in the 
next section. 


§ 4. REFORMULATION IN TERMS OF LyTERVAL DISTRIBUTIONS. 


For event of class r any r adjacent intervals are correlated, but any 
intervals separated by + or more intervening intervals are uncorrelated. 
Let 1,,(€)dé be the probability that the sum of » adjacent intervals lies 
between € and €+d€. Then the probability that n events or more occur 
in a time ¢ after a given occurrence is given by 

“t 
qt=| t(f)dé (mS1). . . . . . (7) 


0 


Thus the probability that exactly n events occur is 


Prtt)=nlt)—dn41(8) (n>1). Q : es : (28) 
Hence the problem of determining the distribution of n is equivalent to 
determining f,,(€). 

The explicit evaluation of ¢,,(&) is complicated ; for large n we should 
expect t,,(€) to be normally distributed, and it might be possible to derive 
asymptotic results on this basis. | However, we shall find it more 
convenient to derive the moments of the t,,(&) distribution (which can be 
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done quite readily even for events of class 7) and deduce results by analogy 
with events of class 1, for-which an exact solution is available. For events 
of class 1, the cumulants of the ¢,,(€) distribution are simply n times the 
cumulants of the g(€) distribution, so that 


Ly Ng, 
(n)__,,(n)? 2 , ) 
My’ Py" =2(Jo—9})- 


We have seen that « and I depend only on the two moments g, and gs, and 
hence only on the two terms of (29). For events of class r let 


G( Sues wane dey unde; 


be the joint probability distribution of r adjacent intervals, and denote the 
mean value of any interval, E(gé,), by g,, and H(é, €;) by g,,; thus if 
ir, H(é, &)=g?. Then 


pMP=H(E+ ... +€,)=E(E,)+E(&)+ ...E(E,)=ng1 - (30) 
pk e 2) -F (= e425 zf,) 


=N944+2(M—1)9y94+-2(n—2)gig+ «.. 2(n—r+1)qi, 
{n?—n—2[(n—1)+ ...(n—r+l]1)}}gf. . . . (31) 


Hence 
pr —p"=nlgiit2Wiet -- - 2G1,-—2r—1gi] 


eee | a, 
ee ee Vili so 182 


The second term of (32) does not involve n, and since r is a finite number, 
it will become negligible for sufficiently large n; in fact, it represents some- 
thing in the nature of an end effect, and we should not expect it to influence 
the asymptotic distribution of the n’s. With the exception of this term 
there is a complete analogy between (20) and (32), and (29), and the 
problem of class 7 is thus equivalent to a problem of class 1, in which gp 
is replaced by 


Grr t2(G1e+ - - + G1) —(27—2) gH. i eer), 


Using the result (19) for events of class 1, we deduce that, for events of 
class 7, 


a=1/Agy, 
pee Pete giaatee 8 1p 32) 


| gi 
We may also observe that there would be no difficulty in deriving higher 
moments in a similar manner. 
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§ 5. THE CorRELATION FUNCTION. 


So far we have been dealing with asymptotic values for large intervals of 
time; we shall now consider an interval [0, ¢] of finitelength. Let usdivide the 
interval [0, f] into a large number N of sub-intervals of length 6¢, and denote 
the number of. events occurring in the 7th sub- interval, [t, t,+82], By dn,. 
Then, since we have assumed that the process is in a “ stationary ”’ con- 
dition, E (8n,) is independent of t,, and E (8n,;6n;) depends only on (¢;—¢;) 
Therefore 


j=n[ 20, |= 2 Ben) = ht dio dn a Cee 


so that this relation is the same for finite t. We now write 


where (pr) can be regarded as a correlation function, indicating the time 
during which correlation is appreciable. Then 


N 2 N N 
Bint) =H ( 2 bn) =5( Pah bn.) = & K(én,6n,), Bars 34) 
i=1 ij=1 ij=1 


and as N tends to oo, this can readily be transformed into 
t t 
2(Aa)? i (ir) + p(7)]dr=Aa2P+-2(Aax)? | (=r) pride eS 
0 0 
Hence if we define y(t) by the relation 
K(n?)—[H(n) P=y"adt, Pehla ree eee Cece) 


for a finite interval t, we shall have 


6 t 
Waal (3) oe ‘ (tc) p(7)das <7 5) ee ee 
(It should be noted that p(7) has a 6-function singularity at t=0 corre- 
sponding to 7= 7 in (36)). 

It is not difficult to see that, if the probability of two events occurring in 
an interval dt is 0(6¢7), y?(¢) must tend to unity ast > 0. On the other hand 
by definition y?(t) >I? as too, so that y*(t) passes from 1 to I as t goes 
from 0 to co. Letting to in (40), 


P=20x |” plr)dr. hi ie Poa aay 
0 


The above formule are well illustrated by the example considered 
previously of primary and secondary events separated by a fixed interval 
8, for which 


2Anp(7)=8(7)+5(7—B). . . 2 . . . (42) 


| It 8) 
Here, y(t) = 
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so that for <8, the events may be regarded as single and random, but for 
t>f they are to be regarded as doublets. 

We can readily make use of the correlation function to determine the 
correlation between any two finite intervals of lengths ¢, and t,, separated 
by an interval of length wu. If m refers to the number of events occurring in 
the first interval, and n to the number of events occurring in the second, 


M N 
E(mn)=E (2 a 5m) (2 Pa 5n,) PAs B(m,n,) 
i=1 j=1 


and this can Sieuaily be transformed into 
ty ty 

A202 | tata | tp(u+7)dr+t, siQeaake 
0 


ti +te 
safe. peepee) yolu-+rydr | (tt) panna) 


For events of class 1, we can obtain an explicit solution for the correla- 
tion function by means of Laplace transforms. We require first to specify 
the value of W,(p) in (15) to correspond to the “ stationary ”’ condition. ' 
This can most readily be determined by the condition that E(n) must be 
equal to A«t for any finite t; hence W,(p)=[1—G(p)]/g,p. (An alterna- 
tive derivation is given by Domb (1948)). We therefore deduce from (5) 
that the Laplace transform of E(n?) is 


1+G(p) 
gip{1—G(p)] 
Let R(p) be the Laplace transform of p(t). Then taking Laplace trans- 
forms in (38) and equating to (44) we have 


2(Ax)” a 2(Aa)PR(p) _ Aw 1+G(p)] 
p> p* p*[1—G(p)] 
le Cpa yA ae ees eee 
Se ee et te (48 
: R(p)= xyf1—Gip)]  p  2Aa pp * def] —G)I oe) 
The first term of the final expression in (45) corresponds to the 6-function 
singularity at <=0. When there is no correlation, so that G(p)=A/p-+-A 
the remaining terms in (45) vanish, as expected. 
We now apply (45) to some particular cases considered previously in 
§2. 
(i) For an «-particle counter of constant recovery time f 
G(p)=AeP/(p+A) and = a=1/(1+a). 
1 (1+fBaAje*? 1 


Therefore R(p)— Da = p-LA— neo? = Pp “ : : S 5 (46) 


(44) 


and this can be expanded in the form 


(1+frje#?  A(1+-BA)e*P 
+e ee ieee AE gins, or ee AC SH 
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Thus we have an explicit expression for p(¢) in the form 


il 
p(t)— oO) 


242 
= 140+ pA) | eM tae M+ Sheet a (t,=t—rr), . (48) 


where, if (r—1)r<t<rrr, r terms are to be taken into account. 

It is also of interest to consider the asymptotic behaviour of p(¢), and this 
can readily be obtained from the poles of (46). There is no pole at p—0, 
and hence the dominating term in an asymptotic series such as (16) is given 
by the real root, —n, of 


pta—rve~ ”=0. 
(L+BA)L—a/A) it 
{jo eh Se ee) 
agi rO~ "T+ B0—a) 
(ii) Let 
ij== se aphet: so that G(p)= oa and gy=v/r 
g(t)= Tey: > P)= ppn 1 : 
v vrr—-t ey 
See t— eee C0 
We have R(p) Sn (Ae ee (50) 
When v is an integer, 7, this can be readily evaluated explicitly as a sum of 
exponentials, the poles being given by p/A=—1-+-e”*""; for example when 
DPA, 
p(t)— ay se Sg) bh aS ee CAEL EF 


The asymptotic form of p(¢) for general v cannot be obtained as readily 
as in the previous example, since branch line integrals must be considered. 


§ 6. THE GENERALIZATION OF CAMPBELL’S THEOREM. 


_ For purely random events Campbell considered the distribution of 
amplitude in an instrument, when the effect of each event could be repre- 
sented by an amplitude s(t) after time ¢. We shall now consider the general- 
ization of his results to correlated events. Let y be the amplitude at the 
time of observation, and let us measure time t backwards from this point ; 
again we divide our time interval into a large number of small sub-intervals 
[t,, ¢,-+6¢], and 


Yc Lil 2; On; 8(t,). ae sc an 
ot—>0 7% 
Therefore E(y)= Lt SE(8n,)s(t) =o. [ so Het wk uceeee, 
ot—>0 7% 0 
and K(y’)= Lt YY K(sn,5n,)s(t,)s(t,). (54) 


ot—0 7 
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Substituting from (36) and proceeding as before, this double integral can be 
transformed into - 


ioe) 


cra] J sty dt | +200) [aap | s(t)s(t+-7)dt . . (55) 


0 
and thus 
By) —[LEW)P=202)* | p(r)de | s(s(t-+r)dt. (56) 
0 0 
This may be regarded as a generalized form of Campbell’s theorem, the 


6-function term in p(r) giving rise to the usual term | s(t)dt. As a 


. . . . . © 
particular example, if p(t) is only significant during a time for which s(t) 
does not change appreciably, the second integral in (56) can be replaced by 


| s*(t)dt, and using (41) we have 
0 
E(y2)—[E(y) P= acl? [ wae Boe el OEE 
0 


This is the usual form of the theorem for the shot effect in the presence of 
‘space charge. 

If we now take s(t) to correspond to a sharply tuned circuit at frequency 
w/27 E(y?) will correspond to a “power spectrum ’”’, and we obtain the 
standard formula 


Plw)= |" 2p(7) cos wr dr. tee pay t  Se(DS) 


This is especially useful in considering events of class 1, since R(p) 
has been evaluated explicitly in (45), and 


P(w)=R(tw)+R(—tw), . . . . « . (59) 


As a particular case, for the «-particle counter we deduce from (46) that 


wW 
I—cos Bw+- sin Bw 
P(w)= ae 1— A at (00) 
1—cos Bw +> sin Bw + w?/2A2 


It is interesting to note that as A tends to oo, this tends to zero for all 
frequencies other than multiples of 27/8, which is in accordance with what 
we would expect physically. 

A rather interesting example is that of primary and secondary events 
separated by a fixed interval 8. From (42) we readily deduce that 


P(w)=, cost. ee rian 22 (OL) 


980 ©. Domb on the 


§7. THe PartITIONING OF EVENTS 


We now consider the effect of the random partitioning of correlated 
events into two groups. This problem has a physical application to valves 
containing two or more electrodes, and gives rise to the phenomenon of 
‘partition noise.” Let «, and «, be the probabilities that any given event 
falls into groups 1 or 2 respectively (z,+-%,=1). Then the probability of 
an occurrence of n, events in group | and 7, in group 2 in an interval [0, ¢] is 

. n | Ny pn, Ne, if (62) 
P(N, No 5 Desa al Lea Wig tgs a> kre eae 
where n=n,+-n, and u,(t) is the probability of n events occurring in (0, ¢]. 
We have seen from §’s 2, 3 that w,,(¢) may be formally written as PQ” 1w,(t) 
where P and Q are certain linear operators. The joint moment generating 
function of n, and n, is given by 
(91, A; = Z p(y, na; thee tmh 


NyNo 


! 
a MA ee DH Sed cia (a, Qe) (a.Qe")”*w , (t) 
NyNo N+ Ng: 
= PL (a+ x.€%)"Q” tw, (é). Nah eiberintcry 5 O72 55) 


This is identical in form with the moment generating function for n except 
that e’ is replaced by «,e%-+-«,e%. Hence if we know the expansion of the 
generating function for 2 we can immediately determine the corresponding 
expansion for n, and n, by replacing 6 by 


In [1-04 (e%—1)+-a9(e%—1)]. 


In particular if we take into account only two terms of the cumulant 
generating function of n, we have 


In u(8, t) ~ Aat(O+T707/2+...) 2. . . . (64) 


and the corresponding joint function is 
62 
In ¥/(0;, 02; t)=Aat | eB +b. % (%1%9-+ 041) 


02 
ye 7 (4142051) + 01 4%90,85(L?—1) ae i. . (65) 
Thus, if we confine our attention to events of group 1, 
Pi FO gS el eae (66) 
and we may deduce the relation 


E(nn2)—E(n,)E(n9) ; 
(iia, fay) = Vt%0)(*—1), Sy Seer loT) 


These latter results are standard in the theory of partition noise. 
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It has been pointed out to the author by Professor Pryce that all the 
above results can be deduced quite generally by an elementary process of 
averaging, since the partitioning process is independent of the correlating 
mechanism. In fact this is effectively the method used by Bakker (1938) to 


- deduce the formula (66) and (67)*. 


We now consider the interval distribution of partitioned events. We 
first observe that if the original series consists of events of class 7, the parti- 
tioned series does likewise, since intervals separated by r intervening 
intervals remain u correlated. For events of class | the moment generating 
function of the partitioned series can readily be written down. The pro- 
blem is then to determine the distribution of €,+€,+ ...&,, where the 
és are independently distributed with probability g(é)d€, and r has distri- 
bution «,«3°'. This is a problem arising in many different fields 
(Kendall 1949) and the solution has been given independently by a number 
of writers (e.g. Woodward 1948). The moment generating function 
of the moment r’s is «,/(e~°—«,.), and that of the é’s is G(—6@). Hence the 
generating function G® (—4#) of the partitioned series is given by 


Car eee Male b) lets. ee a ee (OS) 


The first two moments can easily be determined from (68) 


GP =gies ] 


Sc ae OR tem re i 
2 J2—9J1)/%1—9 7 %9/ 03, 


and we verify from (19) that (66) is satisfied. 
For events of class r, although we cannot write down the complete 
generating function, we can determine the first and second moments. 


Thus 
Pr EB Eres to: fe) =Brgy) gilt, SRR ey ih 6 getter ALO) 


guy =EE (E+ bet ee eH IG ia eT — Gin tt — 2 gist al 
i ; 
2s gi, Wart Poagiat 205919 + ete PN ck ES era a 8) 


The correlations can be derived similarly, although the detailed evalua- 
tion of the various terms is rather cumbersome. We obtain 


2 s(s-+1) 
fea *| gb saf ern Fy 8 us-+-2)+ De (6>2) ena) 


the coefficients of the various terms being those in the expansion of (l—a) *. 


* T am indebted to Dr. D. K. C. MacDonald for drawing my attention to this 
paper. 
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The series (71) (72) are not very convenient for manipulation, since the 
gi, remain finite for large s. It is more convenient to take away the mean 
values, and put the results in the form 


1 
oo? Batt 2 | ug) +20alde— 424.098) + - «|. 78) 
1 


- g(s-+1) 
Ve—-N=ay” | (asa) -+843(9 699) + a1 Gue+9—91)+ - - | : 


We can then easily verify, on substituting in (34), that the result (66) is 
obtained for the partitioned series, and this provides a useful check on the 
internal consistency of the formule derived. 
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SUMMARY. 


The author’s equation for the di-iso-electronic row enables us to compute 
by extrapolation the electron affinity of hydrogen, Ey=—0-:74eV. The 
author’s equation is nearly identical with that of Hylleraas, but differs in 
an important term from that of EKucken. A relation is given between 
screening number and effective charge on one hand, and nuclear charge 
number Z and number of electrons in the isorow, 7, on the other. 
Numerical values of the constants required to calculate screening numbers 
S,in the K, L and M—-s isorows are givenin Table I. It appears that with 
increasing Z, 8; approaches half the coefficient of the linear member of 
equation (10), 7. e., 8; ~4(20—1—@’) ; when Z>20, the screening constant 
for the di-iso-electronic row, 8,;=0-631, and that for the tri-iso-row, 
Sy7=1-63 and that for the ennea-isorow §,x=6-01, or Sjx=6-40 


respectively. 


IN a previous paper on the K and L electrons (Richter 1948), the ionization 
potentials of iso-electronic atomic systems were expressed in a quadratic 
function of the variable Z+-1—7, where Z is the charge number of the 
nucleus and 7 the number of electrons surrounding it ; the coefficients in 
the expression are functions of 7 only. The present work relates this 
method to the usual expression in terms of screening constants. 

From the known experimental data we have derived the following 
expression for the ionization potentials J in a given iso-electronic row 


J,=4B(Z-+1—1)-+ (A,—$B)(Z+1—1)+D,= (Z—8,)PR/n’, 


(1) 
where B,, A, and D, represent the quantum, row and correction constants 
respectively of the given row. Values of these constants in volts for the 
K and Liso-electronic rows are given in Table III. of the above-mentioned - 
publication. Since B; is very nearly the same for both K rows, and again 
for all L rows, we may take average values By, By, and the reduced 
constants corresponding to this choice are given in the same table ; these 
reduced constants are distinguished by a dash attached to the symbol. 
Usually the values of the individual and reduced constants differ little, 


é. g. By =27-07, Ayn 23:72, Dy,=0-74, 
Bee B21 eA 23:57, Dy 1:02. 


* Communicated by the Author, 
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The right-hand member of equations (1), where R denotes Rydberg’s 
constant and n the principal quantum number, serves to evaluate the 
- screening number §S,, or the effective charge. On inserting the individual 
constants for i=2 in equation (1) we obtain the equation for the di-iso- 
electronic row : 


Jyy=13-535(Z—1)? + 10-17(Z—1)+0-74=R(Z—S8y).. . (2) 


It is worth noting that 10-17=3R/4, corresponding to.the K—« line of 
the X-ray spectra, and further that extrapolation to Z=1 gives Ey—0-74eV. 
for the electron affinity of hydrogen. This value is identical with that 
determined by C. A. Hylleraas (Kucken 1938). 

The difference between the di-isoelectronic formula and the quadratic 
hydrogen term is directly connected with the deviation of the screening 
number §, from the value corresponding to ‘‘ complete”’ screening, namely 7. 
The screening number for the di-isoelectronic row derived from equation 
(2) is then given by 


Syp=Z—[Z?—1-263Z-+0-317}#=Z—Z"*=Z—Z, . . . (8) 
or, using the reduced constants : 
Sj, =Z—[Z?—1-260Z+-0-334]}=Z—Z"*=Z—Z,,. . . (3.2) 
Then for Z=>20 we have the simple result : 
S,;=0-631 and the Bue charge [Zi,?=Z—0-631. . . (4) 


Two other equations have been elaborated for the di-isoelectronic row, 
one by C. A. Hyllerass (Eucken 1938) on wave-mechanical grounds : 


J yp= R[Z?— 1-250Z+0-31488—0-01752Z-1+.0-00548Z-2]— RZ*, 
(5) 
and the other by Eucken (1938) : 
de RIZ 12507]. 4 See ee 16) 


It is clear that expressions (2) and (4) differ very little even for small 
Z, Hylleraas’ result giving S;;~0-625. The strange-looking symbol 
V Zi; is introduced to accord with the notation of Hylleraas and Hartree 
(Kucken 1938). 


The analogous expressions for the tri-iso-electronic row are as follows : 
Jyp=3-40(Z—2)?-+-2-54(Z—2)—0-38 os 2. (7) 

and 
Spy=Z—[Z?—3-254Z 42-4018 . 2. 1... (8) 
It may be mentioned that the value 2-54 in equation (7) is 3R/16, corre- 


sponding to the L—£ line of the X spectra. As in the previous case, S 
becomes practically constant for Z>20, 
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Our picture of the dependence of the screening number S, on Z and i 
becomes more intricate (Richter and Krestynové 1947) whenever the 
quantum constant of the iso-row in question differs from that of the hydrogen 
L level, 7. e. R/2=6-80. The simple picture may nevertheless be retained, 
substituting the value for the hydrogen L level either for the individual 
quantum constant of the L isoelectronic set under examination or for the 
reduced constant B;(=6-90). No very significant error is so introduced, 
the mean error of the reduced quantum constant (c=0-033) being less 
than that of any individual quantum constant in the L isorows with the 
sole exception of the tri-isoelectronic quantum constant with its c—0-019. 
To obtain the dependence in a simple algebraic form, we spline quantities 
G;, d,, a, di, as follows : 


G,=2A,/B,, d;=2D,/B, a@=2A//Bi, dj=2D/Bi. . . (9) 


Then the relations between Z and 7 on one hand and §, or Z? on the other 
are expressed by these equations : 


§,=Z—[Z?—(i—a,+i—1)Z+ (i—@) (6-1) +d} =Z— 24, 
(10) 
LY fej Sa ea (ti VZ+(i—a Mi—1) +d =Z— ZF. 
(10.2) 
The calculated values of G;, d,, a; di necessary to compute §; and Z¥ in 


the first twelve iso-electronic rows are given in Table I. On inserting 
the appropriate values we can write, ¢. g. 


Tope T= 194)997,-8 99-35 ee es. (1) 
and  _ Z;y =[Z?—12-029-+ 32-35]! =Z[1 —12-029Z-14 32-3522]. 
If ont % | 
| —12-029Z-14.32-35Z-2| =a,,, then Zj,;=Z(1taj,x)#. (11.2) 
Under the supposition - 
—l<a<l 
only the first two terms of Taylor’s series are sufficient to determine the 
practically constant value of the screening power in the isorow inquired 
into, 7.€. Sx. 
Z(1+- ax) =Z(1+dayx—$eix— - .. = Sl1+327x+Reo(ajx)]. 
(11-3) 
The error caused by neglecting the remainder R,(a;x) of Taylor’s series: 
can be easily approximated by the inegality 


io eae 1 |x? 
R eo ah ER 8 See ae (Le 
(<3 Va] 21—|x| ey 


giving a more and less approximation. 
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As x depends on Z what we express by a, it is evident that 


| Xx, 20 | > | @1x, 40 | > | %rx, 80 | + 


—12-029 . 32-35 
As | Mx,20]= POTS 400 =0-68<l, . 


the formula (11-3) can be limited to the first two terms only without 
committing a significant error in evaluating S;x for Z2>20. 


Therefore Zrx=Z(1—6-01Z-4) and Sjx=—6-01. . . . (114) 
The evaluated error accounts to 
[Races <0-18< 072s. 0 secant 


In the table of O. Krestynova (4) the values for Z=20, Syx 99>=6-15 and 
for Z=90, Srx, 99=6-04 are to be found. 


TABLE I. 


Constants serving to calculate the screening numbers and the effective 
charges in the K and L isoelectronic rows according to the formula : 


§,=Z—[Z2—(i—a +i—1)Z+ (tH) (¢—1) +d, P= Z—Z*#. 


The numbers with three decimals denote the constants of these 
isorows (<=2, 3, 8 and 9) the empirical quantum numbers of which 
agree with the hydrogen term exactly. 


Isorow 
a 


0-075 
—0-06 


K 
L 


0-44. ; ; 0:26 
0-15 

—0-08 ° Ss 0-01 
0-73 . . 0-33 


7 | —0-435] 4:16 | 3-84 0-17 
| | 4. 8 0-118| 4-81 | 4-19 0-25 
10 | 5-46" 44-54 9 0-76 | 5:45 | 4-55 0:86 
0 | —0-79 1 as 
L 4 0-36 as aa 


The table contains in addition to values for the K and L isorows 
those for the M—s rows. 


eae 8 yee 
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The quantum constant for the M—s isoelectronic. rows, determined 
from the available data for J, is 3-19, with mean error c=0-04, and is thus 
6 per cent higher than the value of the hydrogen M term. This theoretical 
difficulty is increased by the fact that the quantum constant of the group 
M V seems to be clearly different from the constants of the groups M I to 
M IV ; this opinion has been based on the study of the iso-electronic row 
of the 10th d electron, not only on the M level but also on N (Richter 1947). 

From equation (10) it is obvious that the screening power S, varies with 
both Z and 7, but that for given 7 it becomes practically constant for all 
sufficiently large Z ; its asymptotic value approaches half the coefficient 
of the linear term, 20—1—a;. Since the inter-relations of these iso- 
electronic equations with the X-ray spectra on one hand, and the screening 
numbers of A. Sommerfeld, N. Bohr, L. Pauling (Sommerfeld 1931) and 
J.C. Slater (Slater 1930) on the other, are not entirely elucidated, detailed 
discussion is reserved for a further paper. 
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§1. INTRODUCTION. 


Improvep methods for calculating the laminar boundary layer flow for a 
cylindrical body were described in an earlier paper, Piercy, Whitehead and. 
Tyler (1948). The purpose of the present paper is to show how these 
methods can be extended to the solution of the laminar boundary layer 
equations for the axial flow past a solid of revolution. This problem, 
which has received less attention than the two-dimensional case, has been 
examined previously by Millikan (1932) and by Tomotika (1935). In both 
of these papers a momentum integral relationship similar to von Karman’s 
two-dimensional equation has been employed to determine the boundary 
layer thickness. Millikan has assumed a quadratic velocity distribution 
through the boundary layer, which can give only a rough representation of 
the flow, while Tomotika has achieved somewhat greater accuracy by 
employing Pohlhausen’s quartic velocity profile. 

In the present paper the boundary layer equations for the flow past a 
solid of revolution are derived in a form similar to that employed in 
Piercy et al. (1948) for two-dimensional flow. Solutions of these equations 
are found for the flow past cones and are shown to be related to the corre- 
sponding two-dimensional flows past wedges given by Falkner and 
Skan (1930) and Hartree (1937). An approximate solution of general 
applicability is also presented, which is similar to the two-dimensional 
approximate solution described in section II. of Piercy e¢ al. (1948). A 
velocity distribution involving three terms is used and these are chosen to 
give accurate velocity profiles in certain standard cases. Satisfactory 
results are achieved for the whole of the accelerated part of the boundary 
layer of slender bodies and for the earlier part of the retarded flow. Except 
for bluff bodies, a more elaborate solution employing four terms for the 
velocity profile would be required for the accurate determination of the 
laminar separation point. At practical Reynolds numbers, however, 
transition to turbulence will usually occur before the solution using three 
terms ceases to apply. 

The approximate solution, which is exact in the vicinity of the stagna- 
tion point of a body with a rounded nose, is compared with the solutions. 
for the flow past a cone and satisfactory agreement is found. The bound- 
ary layer flow has also been calculated for a sphere with theoretical and 


*Communicated by Prof. N. A. V. Piercy. 
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experimental pressure distributions and the results are compared with 
Tomotika’s solutions. Inthese examples the present method is applicable 
to the whole laminar boundary layer up to the separation point. 

As a final example the flow past an ellipsoid with a fineness ratio of 3 is 
considered. The boundary layer flow corresponding to the experimental 
pressure distribution is calculated and a satisfactory solution is obtained 
over the forward 80 per cent of the laminar layer. The method fails, 
however, to give a solution close to the laminar separation point. 


§2. Notation. 


The notation used is similar to that employed in the two-dimensional 
solutions described in Piercy e¢ al. (1948) but due to the different nature of 
the problem the corresponding functions are defined differently in some 
cases. 


A,B,C. functions of € appearing in the series and approximate 
solutions of the boundary layer equations. 

irrotational flow velocity. . 

velocity ratio h’/U. 

functions defining the curvilinear coordinate system. 

a parameter defining the approximate velocity profile. 

a representative length of the solid of revolution. 

degree of the Legendre function P,,(). 

static pressure. 

component velocity ratios. 

spherical polar coordinates. 

Reynolds number. 

velocity of the undisturbed flow. 

velocity vector. 

cylindrical coordinates, x measured along the axis of symmetry 
and y perpendicular to it. 

velocity potential of flow with axial symmetry. 

Stokes’ stream function. 

a’/Ul and p’/Ul? respectively. 

boundary layer displacement thickness. 

boundary layer momentum thickness. 

functions of 7. 

cos 6). 

kinematic viscosity. 

skin friction. 

coordinates defined in terms of « and f. 

vorticity. 
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§3. Bounpary LAayER EQuaATIONS. 
The exact ‘equations for the incompressible viscous flow with axial 
symmetry past a solid of revolution are first derived in a modified form, 
An orthogonal system of independent variables «’, 8’ and y’ are employed. 
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«’—=constant defines an equipotential and ~’=constant a streamline for the 
inviscid axisymmetric flow past the body while y’ is constant on a plane 
passing through the axis of symmetry. 4’ is the resultant velocity of the 
potential flow and q’ and w’ the velocity components of the viscous flow in 
directions perpendicular to surfaces a=constant and f=constant 
respectively. The velocity ratios g=q'/h’, w=w'/h’ and h=h'/U are 
employed in the equations and «’ and f’ are converted into the non-dimen- 
sional forms « and f by dividing by the undisturbed velocity U and by a 
power of a representative length /. Hence 


eo /UL, B=p'/UP 
and the Reynolds number for the flow is defined as 
R=Ul/v. 
The vector form of the equation of motion for steady flow is 
—Vx¢C=— grad (p/p+4V?)-+v (grad. div V—curlZ), . . . (1) 


where V is the velocity vector and ¢ the vorticity. The corresponding 
form of the equation of continuity for incompressible flow is 


div V0 sh: dat Zi tine eee 


The transformation of these equations into curvilinear coordinates is 
readily carried out. General expressions for the components of the various 
terms in the two equations are given by Milne-Thomson (1937) in terms of 
three factors h,, hz, and hs. The element of length along a line perpendi- _ 
cular to the surface «constant is expressed as h,5« and the corresponding 
values in the two other directions are h,d8 and h,dy respectively. It 
follows therefore that for the present system of orthogonal coordinates 


hy=1/h, h,=Il/yh and h,=y, 


where y is the radial distance from the axis of symmetry divided by the 
representative length /. Substitution in (1) yields the equations 
1 oh oq 


5 oq 
ee ae 2 Nek ir. Abe se 
(rma re foe, Ser rrr 


ee Oa Ce a ESPON Salad 
SUR aa + RR E a an ap (ya) }— ae ie (=) 3 ai 


(3) 


and 
LiCl ane dw Ow qu oy 
i ap +o 14 = ied ao ete eae 
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A third equation does not arise as there is no flow normal to the planes 
y=constant. The equation of continuity takes the form 


Oy | A(yw) 
du ' op 
Equations (3) and (4) are simplified by applying the usual assumptions 
appropriate to flow with a thin viscous boundary layer. In addition the 
change of y through the boundary layer is neglected so that it can be 
replaced by yo, its value on the boundary. This latter assumption will 
introduce small local errors close to the stagnation point on a solid of 
revolution with a conical nose since in this region y is comparable with the 
boundary layer thickness. This does not, however, affect the validity of 
the solution over the major part of the body. With these approximations 
equation (4) reduces to 


eM Bh te a talioe | (8) 


Lah yp 
5 OB pUPA OB’ 
so that the change of pressure through the boundary layer is negligible. 
Hence the value of 0p/d« in (3) can be calculated from the irrotational flow 
just outside the boundary layer, giving 


Op Shan 
axa (" ape 


Equation (3) therefore reduces to 
Og _ I dh yp 0q 
ee Sey eee eer (G 


This equation is to be solved in conjunction with the equation of continuity 
(5) for the boundary conditions y=w=0 on the solid of revolution, which 
forms part of the streamline B=0, and g=1 “or large values of f. 

New variables € and 7 are now introduced in order to simplify the 
boundary conditions. They are defined in terms of « and B by 


fs Raat .."- - 
Sire ae Sage — Oa ed oe! se ae) 


where «, is the value of « at the front stagnation point. The part of the 
streamline B=0 lying downstream of the stagnation point corresponds to 
&=0 so that the boundary conditions are taken as gq=0 for €=0 and q=1 
for large values of €. With this change of variables (6) becomes, after 
substituting from (5) for w 
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an equation which differs only from the two-dimensional form by the 
inclusion in the second term on the left hand side of the additional factor 
4n|y, (dyo/dn). A function ¢ employed in the two-dimensional solution is 
reintroduced here together with an additional function y. They are defined 


by 3 
Qn dh [1- 2n 7... 
=2)/1——> — and p=2 
h ae Yo Mm Ns-0 
Both ¢ and ¢ are functions of 7 only and are zero at the stagnation point of 
a body with a rounded nose. Equation (8) therefore becomes finally 


aq aq fé og (* og 
Des Milan dees qdé= (¢— AjGl = +4 -q— OE: 9 On (9) 


§4. FLow Past CongEs. 


General solutions of the boundary layer equation (9) give a velocity 
profile which varies along the surface, but when ¢ and ¢ are both constant 
a solution for q is found which is independent of 7. It is readily shown that 
the flow past a cone is of this type. The velocity potential « is expressed 
generally in terms of Legendre functions by the relation 


a=r"P,(H) » 
where »=cos 6, and r and 6, are spherical polar coordinates. 6, the value 


of 0, on the conical boundary, is found from the condition 0«/d@,=—0 and 
hence is given by the solution of the equation 


d 
du [P,.(u)] =? 


4, is the external angle between the axis and the wall of the cone, it has the 
value 7/2 for n=2 giving the flow up to a stagnation point. When n=1, 
§>=7 and a cone of vanishing thickness results. Values of 6, for inter- 
mediate values of n, between these two limits, are plotted against 7/n in 
fig. 1. On the conical boundary, 7 and y, are given therefore by 
n=r"P,, (cos O) and y=rsin 6. 

Hence 

g=—p=4in—2 we 


With these values of ¢ and ¢% substituted in (9), the equation for q, as a 
function of € only, takes the form 


at (2+5)5¢ |, ade=4(1—2)(@t—0, 1s, ALD 


This equation is transformed by the further substitution €,=€(2+4/n)-4 
into 


o"q , og 2(n—1) 
and if z’=q nee es Ae inte it becomes identical with the equation 
Rite ca \(L>2?)=-0... Se ee eee (12) 
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which has been solved by Hartree for the corresponding two-dimensional 
| problem of the flow past a wedge. Hartree gives accurate numerical 
values for z for a series of values of the parameter A. The flow up to the 
stagnation point on a plane wall (n=2) corresponds to A=+4 so that the 
velocity profile will differ appreciably from that near a two-dimensional 
stagnation point for which A=1. The solution is not valid for values of n 
close to unity for the boundary layer thickness on the slender cones which 
occur in this range is comparable with yp. 


Mion 


5 


2 
Variation of cone angle with n. 


§5, SERIES SOLUTION FOR 4 Bopy WITH A ROUNDED NOosE. 

A series solution starting from the front stagnation point can be con- 
structed in a manner similar to that described in Piercy et al. (1948) but, 
due to the presence of the additional function y, it will involve, for equal 
accuracy, the use of a larger number of terms. An expansion for q is 
assumed in the form 


g=A+Bd+C¢?+"..:.,- . - « « . > (18) 
and each function B, C, etc. is further expanded into series involving 
functions of € multiplied by powers and products of the coefficients in the 
expansions 

dd 2 3 

cays a +cd?+ ...% , 
and 


p=a,b+b 4? +¢,¢°. 
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Only the function A and the term in B which is independent of these — 
coefficients have been examined as they are required in the approximate 
solution given later. They satisfy the equations 


é 
A’4+4A! | Adé—2(A2—1)=0, 
0 


and 


r& é 
Bit 4A’ | _ Bylé-+4B, | " Adg—4AB)=1—A®, 


The boundary conditions for A are A=0 for €=0 and A=1 for € large. 
The equation for A is identical with (11) if n is put equal to 2 so that the 
solution is given in Hartree’s paper. For B, the boundary conditions are 
B,=0 at €=0 and B,=0 for é large. The equation for By has been solved 
by a method of successive approximations and its values together with A 
are given in Table I. The second derivatives for all the further functions 
vanish at €=0 but otherwise the boundary conditions are identical with 
those of By. 


§6. THE APPROXIMATE SOLUTION. 

An approximate solution which can be employed over the greater part of 
a laminar boundary layer is next considered. An expression for q is 
assumed which contains three terms. The first two of these are the initial 
terms of the series solution and a third term, multiplied by a variable 
parameter, is used to represent approximately the remaining terms of the 
series. This type of velocity distribution has given good results for two- 
dimensional flow and is justified by the fact that the third term replaces 
terms which have the same boundary conditions and vary in a similar 
manner throughout the range. Therefore ¢ is expressed as 


GA Bed K -f(é), SL ee Le) 


where the parameter K is a function of 7 only, and is determined from a 
momentum integral relationship. A more elaborate solution using a 
fourth term and two integral relationships would give more accurate 
results in the retarded region, as is shown in Piercy et al. (1948) but the 
single term used here is adequate for accelerated flow. K is chosen 
arbitrarily as unity for the flow past a cylindrical tube with its axis lying 
along the stream. For this case equation (10) reduces to the two-dimen- 
sional form for the flat plate since, as y) and h are constant, 6B==2. Thus 
the velocity profile is given by the well known Blasius solution for zero 
pressure gradient. (€) is chosen therefore so that q takes this form for 
f=2 and K=1, its values are given in Table I. 

The appropriate form of the momentum integral relationship is obtained 
by integrating (9) through the boundary layer, leading to 


(3). ee "| - (q—q2) dé-+(2— | (l—q *)dE-+4y 7 {* (Gade. 


(15) 
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This equation may be expressed in terms of the two functions 
=| (q—a)dg and 5={° A—gyae, 
0 0 


which are related to the momentum thickness 6 and the displacement 
thickness 5 of the boundary layer respectively, by 


i 7-(z) @ and r=a(8) Ones aoe eet CLOY, 
TABLE I. 

5 B- —By f(é) 
0-0 0-0 0-0 0-0 
0-2 0-3311 0-0510 0-0963 
0-4 0-5834 0-0674- 0-1839 
0-6 0-7615 0:0605 0:2467 
0: 0-8760 0:04:32 0-2728 
1-0 0-9421 0-0257 0-2609 
1-2 0-9760 0-0130 0-2210 
1-4 0-9913 0:0056 0-1686 
1-6 0:9974 0:0020 0-1173 
1:8 0-9993 0-0006 0:0748 
2-0 0-9999 0-0002 0-04.40 
2-2 1-0000 0-:0000 0:0241 
2-4 : ‘ 0:0122 
2-6 . é 0-0058 
2-8 : E 0-0025 
3-0 ’ i 0-:0010 
3-2 1-0000 0-0000 0-0004- 


Equation (15) therefore takes the form 
at) dé = = 
ce Ved eel (0 ed Oe (2 — dont 2 SLT 
(3) ._.= 41g, + Od WI 0-8) (17) 


and the numerical values of 0, 5 and (@q/0€) s<o appropriate to the chosen 
velocity distribution are 


0=0-°1752 +0-01926 —0-0027¢? -+0-2425K , 
20-0210p Ko —0-0602 Ky) Ph it a (18) 
POA n 0-0 bomen 0-348 1 Ke. on ee a. (19) 
(8q/8é)¢-p=1-8554 —0-35236 —0-4868K, . . . . . « « (20) 


The non-dimensional skin friction coefficient 7R}/}pU? is expressed in 
terms of (0q/0€)-_) by the relation 


aes = (3) 
IpU? 7 \0E]e—01 


996 L. G. Whitehead and G. S. Canetti on the Lamunar 


A differential equation for K is obtained by substituting these numerical 
values in (17) and it is solved for each example by a step by step method of 
integration. The equation is 
4n dK |dn (0:2425—0-02194 —0-1384K) 
—0-0904¢2 —2-6379K +0-4150K? +0-7221¢K 
—4y ddjdn (0:0192 —0-0054¢ —0-0219K) 
+4 (0-1752 +0-01926 +0-2425K) 
—(h+rb) (0-0027¢? +0-02196K +0-0692K?). 

A first check on the accuracy of the approximate solution is made by a 
comparison with the accurate solutions for the flow past a cone found in 
§4. For these examples, as ¢ and % are constant, K is also constant and 
equation (21) reduces to the simple quadratic for K 
1-9184 1-1392  1-8400 


; + 0-6352 = 0. 
n n 


0-4150K? + ( — 39971) K+ 


Values of K for a range of n between 1 and 2 are given in Table II. and the 
approximate values of (0q/0£);_) are compared with values found by 
Tomotika’s method and with the accurate values obtained by Hartree. 
The accuracy for the present approximate solution is seen to be superior 
to that of Tomotika’s solution, the maximum error for the range considered 
being 0-6 per cent compared with 1-8 per cent. 


TaBLeE II. 

og oq og 
r n K = — =< 

Cee le 

en Tomotika Hartree 
0-1 1-159 —0-0529 1-369 1-380 1-371 
0-2 1-333 —0:0477 1-526 1-525 1-536 
0:3 1-530 —0-:0343 1-655 1-639 1-665 
0-4. 1-750 == ()-() a ae 1-763 1-736 1-768 


0-5 2-000 0-0000 1-855 1-832 1-855 


§7. Tae Laminar Bounpary Layer on A SPHERE. 


The boundary layer flow on a sphere has been calculated previously by 
Tomotika for the potential flow pressure distribution and by Tomotika and 
Imai (1938) for an experimental pressure distribution measured by Fage 
(1936). Solutions for these examples have been calculated by the present 
method and are compared with these earlier results. 

For the irrotational flow past the sphere the velocity near the boundary 
is given by h=} sin 6, and, if the representative length is taken as the 
diameter, = }(1—cos 6,) and y,=4 sin 6,. The corresponding values of 
$, y and y (d¢/dy) are therefore 


ead ga 2(1—cos 6,) d@ 4(1—cos 6,) 


1--cos 60 1 dn (Fcos Cres 
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and it is convenient to change the variable 7 in equation (21) to 0, for the 
step by step integration. A series solution for small values of 6, shows 
that only even powers arise and the starting value of K is given by 


K=0-01365 62+ ..... 


The skin friction coefficient +R*/4pU? is plotted as the continuous line in 
fig. 2 and the dotted line shows Tomotika’s solution. The separation 


Fig. 2. 


— Present method 
Tomotika 


Skin friction on, sphere, theoretical pressure distribution. 


point is found to be 107° from the front stagnation point compared with 
108° for his solution. 

Tomotika and Imai have expressed the experimental velocity distri- 
bution on a sphere measured by Fage for an entirely laminar boundary 
layer as 

h=1+5 0,—0-43707 63-+-0-148097 03—0-042329 67, 
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and this form is retained for the calculation of ¢ and ¢ in the present solu- 
tion, though it differs slightly from the measured values in the immediate 
vicinity of the separation point. ¢, 4 and y (d¢/dm) are calculated from the 
relations 


=| hd, 
a 4n dh 
p=2 yt h2 dé, ? 


4 
p=2 — = . cot O,, 


dp 4m [2n(dh\?__ (dh 38 
1-H d0,)  \d0,) hdo2f- 


Fig. 3. 


and 


Present method 
Tomotika and Imai 
Skin friction on sphere, experimental pressure distribution. 


The inital values for the step by step integration of (21) are given by 
K=—0-00035 6} in this case.. The skin friction coefficient for the 
experimental pressure distribution is plotted as the continuous line in fig. 3 
and the dotted line gives Tomotika and Imai’s solution. Somewhat higher 
values for the skin friction are given by ‘thé present method for both 
pressure distributions. Tomotika’s method is a development of - 
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Pohlhausen’s two-dimensional solution which has been compared by 
Bairstow (1939) with more accurate solutions for the flow past a circular 
cylinder. In this two-dimensional problem it is found that Pohlhausen’s 
method underestimates the friction and it is likely that Tomotika’s solution 
for the sphere will involve similar errors. The separation point calculated 
from the experimental pressure distribution is 83-3° from the front 
stagnation point compared with 84° for Tomotika’s solution and the 
experimental value of 83° found by Fage. 


§8. FLtow Past an ELLipsoip. 


The boundary layer flow past an ovary ellipsoid of fineness ratio 3 
is next considered. This example involves much smaller pressure gradients 
than occur on the sphere over the major part of the boundary layer and in 
this respect it is more representative of a practical streamline shape. The 


Fig. 4. 
lad 
eZ 
et 
1-O 
O O-2 OAT oce. OO O'8 


Velocity outside the boundary layer of the ellipsoid. 


experimental pressure distribution has been measured at a Reynolds’ 
number of 2:5 10° based on the major axis, which is 1 ft. long. The 
ellipsoid was mounted in a 4 ft. square wind tunnel and supported by a 
tube 1 in. diameter which formed a rearward extension of the ellipsoid and 
which was faired into it. All pressure connections were carried through 
this tube and the pressures were recorded after carefully aligning the axis 
of the ellipsoid along the stream. In fig. 4, values of h calculated from 
these pressure readings are plotted against x, the distance from the nose 
divided by the major axis. The readings are plotted as far as the laminar 
separation point which is located at ~=0-76. The skin friction calculated 
by the present: method is shown in fig. 5. The solution has not been 
carried beyond z=0-65 as its accuracy is considered doubtful further 
downstream. A difficulty arises in the retarded part of the flow when K 
approaches a value of about 1-4 as the coefficient of dK /dy decreases rapidly 
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and becomes negative for larger values of K. For this reason the method 
fails to give a solution near the laminar separation point in the present 
example. 

§ 9. CONCLUSION. 

Solutions for the laminar boundary layer flow on a solid of revolution are 
described which employ methods previously applied successfully in two- 
dimensional flow. An approximate solution is obtained which is superior in 
accuracy to earlier methods for accelerated boundary layer flow. The 
approximate solution is compared with accurate solutions for the flow past 
cones and satisfactory agreement is found. Comparisons with Tomotika’s 
solutions for the flow past a sphere show that the skin friction calculated 
by the present method is larger over most of the laminar layer. 


Fig. 5. 


Skin friction, for flow past ellipsoid. 
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SUMMARY. 


The theoretical accuracy and certainty with which range may be 
determined by radar is obtained quantitatively by applying the principle 
of inverse probability. In agreement with experience, the theory yields 
two fundamental criteria for satisfactory radar reception. First, the 
total received signal energy must always be larger than the effective noise 
power per unit bandwidth. Secondly, there is a more stringent threshold of 
unambiguous reception which depends not only on the quantity of received 
energy but also on the bandwidth of the transmitted waveform, for as 
this is increased the minimum energy required for the certain detection 
of an echo also increases. The quantity of information obtainable at the 
receiver, as measured by reduction of entropy, is evaluated for comparison 
with that given by Shannon’s general theory of communication (Shannon 
1948). It is found that as the time of observation and hence the received 
energy increases, information is initially obtained at a roughly uniform 
rate which is not far removed from the absolute limit for an ideal 
communication system, but that after crossing the threshold of 
unambiguous reception, additional received energy contributes little 
further information. 

Since only stationary echoing objects are considered, phase-coherent 
integration is implied in the theory and the problem of post-detection 
integration is not discussed. Apart from obviously unsuitable waveforms, 
the theory is applicable to any type of modulation. 


§1. [yrRODUCTION. 
Ir has long been recognized that the amount of information which can 
be obtained from any physical observation is fundamentally limited by 
random fluctuations. In communication problems, for example, the 
physicist or engineer always strives to work with signals which are large 
compared with the background of noise. Superficially, one might be 
tempted to conclude that information can be communicated at a satis- 
factory rate only when the signal power P is greater than the noise power N 
in the system. There is, however, no such abrupt limitation, for Shannon 
(1948, 1949) has shown that the limiting rate at which information can 
be communicated, in the presence of a uniform, spectrum of Gaussian 
noise, is given by 


C=W log, (+5) no ‘a, gag tae Macca Tha abe if 
* Communicated by the Authors. 
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binary choices per unit time, when no frequencies higher than W are 
utilized. Nevertheless, many practical systems do exhibit a threshold of 
intelligibility when P is equal to N, but this is only due to the manner of 
coding the information. It is not revealed by equation (1) because this 
assumes ideal coding, which might be somewhat complicated if P were 
less than N. A threshold is especially pronounced when the information 
is encoded by a non-topological increase in its natural number of 
dimensions (Shannon 1949). 

The simplest information sought from a radar system is the range of — 
an echoing object, presumed stationary in this paper. Such information 
has only a single dimension, but it is coded (perforce) in a special manner 
which increases its dimensionality. A repetitive waveform, not necessarily 
a sequence of pulses, is transmitted, echoed, and received. Observation 
is assumed to last only for a finite time—any integral number of repetition 
periods—during which only a finite quantity of signal energy is available 
at the receiver: the observer compares the transmitted and received 
waveforms and tries to determine the time-delay between them. The 
presence of noise in the received waveform prevents him doing so with 
unlimited precision. Both transmitted and received waveforms occupy 
a substantial area of the time-frequency information diagram as defined 
by Gabor (1946), and therefore encompass a large number of elementary 
cells. In Shannon’s terminology, their dimensionality is high. But since 
the shape of the transmitted waveform is fixed at the outset and contains 
no information at all, and the received waveform is only a delayed copy, 
it is not obvious that this time-frequency area can be utilized to the same 
advantage as in an ordinary communication system. One of the objects 
of this paper is to show that radar can be fairly efficient in this funda- 
mental sense. The primary purpose of the work, however, is to determine 
the threshold of intelligibility, which we might expect to be quite marked 
in view of the nature of the coding. Under conditions of intelligible 
reception, it is further possible to determine the inherent accuracy with 
which target range may be ascertained. 

The mathematical analysis centres round one important concept, the 
a posterior’ probability distribution for the time-delay 7+ between trans- 
mission and reception. This probability distribution describes all that 
can possibly be known about range after a radar observation has been 
made, and is formally derived in §3 after a preliminary statement of 
terminology in §2. The distribution will be found to depend on two 
significant functions of 7, for which the terms signal function and noise 
function are adopted. They are, effectively, the signal auto-correlation 
and signal-noise cross-correlation functions (Wiener 1949). Both of them 
depend on the known shape of the transmitted waveform and on the known 
statistical characteristics of the noise, but apart from this, they depend 
separately on the signal and noise actually received. Their relative 
importance is discussed in §4, where certain of their properties are derived 
for subsequent reference. The so-called signal function shows to what 
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extent the accuracy of a range determination is, in the presence of noise 
inherently limited (§5),-but the limitation it describes is merely the 
“ topological uncertainty’ of the observation, because it refers to a 
connected region of uncertainty in the neighbourhood of the true value 
of the range. The noise function, on the other hand, may produce spurious 
peaks in the probability distribution, representing ‘‘ non-topological 
uncertainty ”’ or ambiguity which may exist as the result of an observation, 
and which may render the observation valueless from a practical point of 
view. Such ambiguity arises from the non-topological mapping referred 
to earlier. The conditions under which ambiguity occurs in a radar 
system are examined in §6, where a formula for the threshold of 
intelligibility is derived—fig. 1 illustrates the effect. Finally, in §7, the 
entropy of the a posteriori distribution is obtained: after subtraction 
from the a priori entropy it represents the information content of the 
observation (fig. 2), and it is interesting to compare this with Shannon’s 
general formula for the information capacity of a communication channel. 
The main features of the results are briefly discussed in the concluding 
section. 


§ 2. REPRESENTATION OF WAVEFORMS. 


The transmitted radio-frequency waveform will be denoted by the 
complex function x(t), and the received waveform by 
AG) cobb (tome ha ck Ay Seg vee eee) 
Upon reception, %(t) will have suffered a change in amplitude and a delay 
of 7 in its passage to and from the target, while v(t) is a complex random 
function representing added noise. These complex functions are all 
formed from the physical waveforms by the addition of an imaginary 
quadrature component in the manner described, for instance, by Gabor 
(1946). Doing the analysis in terms of such complex waveforms introduces 
no spurious information, and it is only necessary to remember that the 
energy which would be dissipated in a unit resistance is no longer the time 
integral of the square of the waveform, but half the integral of the squared 
modulus. As a matter of purely mathematical convenience, it will be 
presumed that the noise energy is confined within a finite band of 
frequencies. As long as this band is sufficiently wide to include the whole 
spectrum of 7, there is no loss of generality, for it is merely equivalent 
to assuming that y(t) has been filtered so as to reduce the noise while 
leaving the signal unaffected. Such curtailing of the noise, which is 
assumed uniformly distributed in frequency, has no bearing on the 
problem of optimum filtering. 
Since y, #, and v are all high-frequency functions, they may be written 
in the form 
| y=yl(t) exp (2mifol), 


p=u(t) exp (2ifot), is wh totes Bio) 
v=n(t) exp (277ifpt), 
4 322 
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where y, w and » are complex low-frequency functions, and f, is the carrier 
frequency defined by 


ite [wr gbae 
Pas 7 ; 


It may be shown (Gabor 1946) that this definition makes f, the centroid 
of the power spectrum of 4. When ; is periodic, it is equivalent to 


fret pjasey 8 On ees 


where the integral is taken over one period and f denotes the instantaneous 
frequency, or rate of change of phase, of 4(¢). The above definitions have 
the merit of applying equally well to any form of modulation, and are 
essential to the subsequent analysis. 

Further, it is convenient to introduce here the vectorial representation 
of waveforms (Shannon 1949, Oswald 1949). Suppose that z(¢) is a 
complex function of t, whose complex frequency spectrum s,(f) is zero 
outside the range of frequencies (— W, W), as is assumed for the particular 
functions y, wu, and n defined above. It is well known that z(t) may be 
completely specified by a set of sample values z(7/2W), and may therefore 
be represented by a complex multi-dimensional vector z with these 
components. The Hermitian product of two such vectors z and w, is 


(4). 


ziw=L2Fw,=2W J 2*(t)w(t) dt. elise ON 


The second equation may be proved by applying Parseval’s theorem 
(Titchmarsh 1939) successively to the Fourier Integrals representing 
2(t) and w(t) and to the Fourier Series representing s,(f') and s,,(f) in the 
interval (—W, W). It follows immediately from equation (6) that 


ulu= |u[P?=2W J] u(t) ? dc=4WE, Macey hohe 349) 


where E denotes the energy of the waveform w, or %, within the range of 
integration. Further, if u’ represents the waveform du/dt, we may define 
an effective bandwidth 8 by the equation 


[wi P= 62] who nln Tene seen 


This bandwidth f is, apart from a constant factor, the effective bandwidth 
as defined by Gabor (1946), who has shown that 62/47? is the second 
moment of the power spectrum of wu about the origin, or % about f. 

Vector notation is especially convenient when dealing with noise 
characteristics, and the results quoted in this section will be used in vector 
and integral form indiscriminately, since each may be transformed into 
the other by equation (6). 


Theory of Radar Information 1005 


§ 3. Bastc THEoRY ofr RADAR OBSERVATION. 


The central problem isto form, from any particular received waveform 
and from a knowledge of the transmitted waveform, the probability 
distribution for the target range, measured in terms of the time-delay r. 
Suppose that, on some typical occasion, the actual value of the time-delay 
is 7) and of the coefficient « is xu); the received waveform is then 


Pesto) bea At WS ow | wt ae ee (8) 


In order to form the probability distribution for + (and in general « also) 
when only y(t) and x(t) are known, one must first determine the probability 
density for the observed y(t) on the assumption of any guessed 7 and «. 
This depends solely on the noise which would be necessary to satisfy the 
assumption. On a hypothesis (7, «), the noise would in fact have to be 


y(t)—oup(t— 7), 


or in terms of the low-frequency functions defined in equation (3), 


{y(t)—au(t—r) exp (—2zifor)} exp (27ifot), . . . - (10) 
The expression in large brackets may be represented by the vector 
Vaca re Ae eee, wee (TL) 


where u(t—7) has become a continuous vector function of 7, and w stands 
for 27f,. The vector (11) uniquely determines the high frequency noise 
fluctuation, and assuming Gaussian noise, its multi-dimensional probability 
density is proportional to 


exp {—|y—au(rje“" P/2N}. . - e+ (12) 


Here N is the mean physical noise power, and is the mean square value 
of both the real and imaginary part of the noise fluctuation at each 
sampling point. All these values are uncorrelated because the noise is 
assumed to extend uniformly over a high-frequency band of width 2W. 

If the a priori probability distribution for 7 and « is denoted by po(r, «), 
then by the principle of inverse probability (Jeffreys 1939) their a posteriort 
distribution is 


p(t, x) “p(t, «) exp {— | y—au(z)e" P/2N}.  . . (13) 


This is the frequency distribution of the true values of 7 and « over an 
ensemble in which the received waveform y(t) is fixed. The @ priori 
distribution, on the other hand, is the corresponding frequency over the 
ensemble of all waveforms received, and must be precisely formulated 
before one can proceed to analyse the problem further. 

Since this paper is concerned with the information content of received 
signals of given energy, it is assumed that the coefficient « is constant, and 
hence that its a priori distribution is given by 


DU) == glee Wa we kk ee (14) 
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Further, it is assumed that the a priori distribution of 7 is uniform over 
a fixed interval T, less than or equal to the repetition period of the 
transmitted waveform. Such an assumption is not only the simplest 
but also the most non-commital one to make, because it represents a 
state of least prior knowledge (Jeffreys 1939) between the limits of the 
distribution. Within the interval T we therefore write 


Polt, 1) = 7:8(0—26). ne tes ee ON 


It follows from equation (13) that the a posteriori distribution for 7 is 
p(t)= | p(t, «) duoc exp {— | y—apu(r)e~” P/2N}, . (16) 
0 


within the a priori interval T, and zero elsewhere. 
This expression will now be reduced to a more convenient form. 
By expanding, and omitting terms independent of 7, we obtain 


p(t) oc exp E Ay ale} |. a in eee 


Of the terms which have disappeared, | y |? is dependent on 7, but not 
on 7, whilst the term | u(z) |? is proportional to the energy of the signal 
over the interval of observation, and is independent of 7 because it extends 
over an integral number of periods. Converting now into integral form, 


p(r) oc exp {zt R | y* (t)u(t— r)e—*" ar} : awe (13) 
where ; 


NWN, ote lee ee 


and N, is the mean noise power per unit bandwidth. 
From equations (3) and (9), substitution for y gives 


] ; 
p(t) cexpF E al {a%u*(t— r9)u(t—r) 


tagnt(ut—nereryat | PRERROTE, hAN} 


Since n(t) is a complex random quantity, the constant exp(—tw7)) may 
be absorbed into it without altering its statistical properties, and is in 
future omitted. Denoting the phase of the integral by 0, and writing 


Cars 


r)= sp | wilt—ro)u(t—r) dt See or 
(x)= se | m*(iu(t—z) dt Slee eee 


one obtains equation (20) in the form 


p(r) «x exp[cos {w(t—7))—9} «| g(7) +h(z) |]. ce een 2) 


Theory of Radar Information 1007 


The cosine is a high-frequency function of +, while @ and also the modulus 
are low-frequency functions. This means that the a posteriori distribution 
consists of a closely-packed sequence of peaks under a slowly varying 
envelope ; these peaks arise from the comparison which the observer 
could make between the carrier phases of transmitted and received 
waveforms. 

The information which this fine structure represents is of no value 
when there is ambiguity between one peak and another; the realistic 
distribution for +, denoted P(r) is consequently given by smoothing p(r) 
to remove fine structure. It is important, however, to realize that, 
although the information which could be obtained by comparing the 
phase of the transmitted and received carriers will be discarded, phase- 
coherence from one repetition cycle of the modulation to the next is 
assumed throughout the interval of observation. In other words, 
variation of phase yields useful information but absolute magnitude does 
not. The smoothed distribution is easily obtained by integrating 
equation (23), with respect to 7, from 7 to t+(2a/w), and assuming that 
the variation of 6 and the modulus are negligible over this interval. This 
is not equivalent to taking the envelope of p(z7), because the fine structure 
is not of a sinusoidal nature. Integration gives, in fact, 


Bee ChG ae th tebe oe ieee Sek) 


where I, is a modified Bessel function, and A is a normalizing constant. 
It should be emphasized that the integrals g and f extend over the whole 
interval of observation. 

The characteristics of P(z) depend markedly on the relative importance 
of g and h, which may be regarded as the contributions of the signal and 
noise to the probability distribution. For convenience, they are termed 
the signal and noise function respectively : they are, in effect, the signal 
auto-correlation and signal-noise cross-correlation functions (Wiener 1949). 


§4. THe SignaL anp NoisE FUuncrrions. 


In this section, some of the properties of g(7) and h(7) will be formulated, 
and there will emerge, incidentally, a criterion for the amount of energy 
which must be received from an echo before satisfactory radar observation 
is possible. 

We should expect P(7) to have a peak in the vicinity of the true range 7, ; 
this will arise from the signal function g(r). Elsewhere, g(r) will in 
practice be negligible because, in effect, radar waveforms are specially 
chosen to ensure it. Near +, one may expand u(f—7r) by Taylor’s theorem, 
and obtain from equation (21) 

%6 


g(r) =< | we(t—re) ult —r9) (779) t= 10) 
+4(r—7))?u"(t—7)} dt, . . ~ (25) 
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neglecting higher powers of r—r) and denoting du/dt by wu’. Writing 
p?==2H/N), Pr rs Cal ee PN, 


where E is now the energy of the received signal «xs, one may obtain from 
equations (7), (4) and (8) the following useful relations : 


x | ul nyult=1{) di = polle (oan 
No 

a2 

a | u* (t= 7,)\th (t—7)) dt =O, en) 
No 

Sf uttt—r )u” (t— T,) dt= — p?B?. 1 a ae Te 2) 
No » 


These are definite integrals, extending over an integral number of repetition 
periods of u(t). .The second relation depends on the definition (4) of 
carrier frequency, while the third is obtained after an integration by 
parts. Consequently, 


g(r) =p— 407 — To + O(7 79)? es (80) 


It will be seen that the peak in g(z) is of height p? occurring at 7), and in 
this neighbourhood g(r) is purely real. This quantity p? is important 
because it is the dimensionless energy parameter which constantly recurs 
in the theory. The expansion of g(r) outlined above is required later, 
when determining the accuracy of range measurement. 

It is now necessary to consider the noise function f(z) and its influence 
on g(t). Transformation of equation (22) into vector notation gives 


h(r)= Sentu(r). = ee eee aaa 


This Hermitian product is a linear function of the components of n, and 
the resultant is a stationary random function whose distributions in real 
and imaginary parts are independent and Gaussian about zero. By 


using the additive property of mean square deviations, one may readily 
show that 


{Bh 7) P= {Fh(7)P—=o2 | u(r) P/N=p2 . . . . (32) 


Furthermore, it is well known (Rice 1945) that the amplitude distribution 
for such a function is given by 


1 
OS a eexp (spi yaee atiainaes oal(3s) 


and that || has a mean square value 2p?. 

When E<Nbo, the root mean squared modulus, p/2, of A(z) is greater 
than the peak value p? of g(z), and the signal and noise functions will not 
be distinct in P(r). A necessary, though insufficient, criterion for satis- 
factory observation is therefore that the total received energy should be 
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greater than the noise power per unit bandwidth, and for convenience in 
the mathematics it will be assumed much greater. The condition 


Rae 2 ee OLR cet es apne (OS) 


applies, in fact, throughout the remainder of this paper. It is insufficient 
because it does not of itself ensure that the area under P(z) is all near T)- 

An immediate consequence of this condition is that the peak of the 
signal function g(7) will be large compared with unity, and it can be seen 
from equation (24) that this feature is accentuated in P(z7), since 
asymptotically I(x) increases exponentially with ~. Thus, for practical 
radar waveforms, the peak in the normalized probability distribution 
due to the signal function will be concentrated near 7, and in equation (30) 
the higher powers of (r—7,)) may be neglected. Except near this peak, 
the signal function will have a negligible influence on P(z) and the noise 
function (if it is of sufficient magnitude) will alone affect it. The total 
interval T over which P(r) is defined will thus be considered in two parts : 
one which includes the peak of P(r) due to the signal function, where 
P(r) will be denoted by P,(7), and the remainder, of a length nearly 
equal to T, where P(r) will be denoted by P,,(7) since g makes a negligible 
contribution. It is possible to demarcate the two regions satisfactorily 
because P,(7) itself consists (when p? is large) of a sequence of isolated 
and randomly occurring peaks. The first region will determine the 
accuracy with which range can be measured, whilst the area under the 
second region represents the degree of ambiguity of the observation. It is 
the probability that, in the ensemble of observations with fixed received 
waveform, the actual range does not lie anywhere near 7). 

The more important features of the signal and noise functions have now 
been enunciated, and it has been seen that, if the total received energy 
is not large compared with the noise power per unit bandwidth, only a 
negligible quantity of information can be obtained from the system. 
This result is not, of course, confined to radar and may be seen quite 
generally from Shannon’s formula, equation (1). 


§5. Rance ACCURACY. 


There are two distinct statistical approaches to the problem of observa- 
tional error. The first (and here deceptively simple) approach is to fix 
the true value and evaluate the spread of the estimates. The other, which 
has been adopted in this paper, is to fix on a typical received waveform 
resulting from a true range 7), and to consider the spread of the true 
values + which could have produced it. The latter approach, involving 
the inverse distribution P(7), eliminates consideration of how the observer 
estimates the true value when confronted with the received waveform. 

Since P(r) represents the properties of an ensemble in which the received 
waveform y is fixed, it is particular, inasmuch as it depends on the specific 
choice of y. It will therefore be necessary, on occasion, to consider 
average statistical properties of P(r) by varying y while keeping 7» fixed 
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(see equation (9)). Averages over this fixed-r, ensemble will be 
distinguished from averages in the fixed-y ensemble by using the symbol 
Av instead of a bar. We shall then demonstrate that P,(7) is approxi- — 
mately Gaussian near 7, and calculate its standard deviation. 

For the present, any ambiguity arising from the behaviour of h(7) 
outside the g-region around 7, is ignored, but its influence inside it is 
interesting and is considered as follows. By equation (31) and the 
Taylor expansion of u(t—r), the noise function near 7» is 


h(r)= 2 {ntu(r9)—(r— 79)’ (79) + O(7—79)"}- > + (35) 


It may be shown that, if p? is sufficiently large, neglect of all but the 
linear and constant terms is not serious in spite of the necessity for 
including the quadratic term in the corresponding expansion of g(z). 
Thus we may write 


hr)=E+in= {Eot(t—7)E3+t{yo+(t—T)n'}, - + (36) 


where &, yo, €’ and 7’ are random in the fixed-7, ensemble and have 
Gaussian distributions about zero. From equation (32), 


Av(2)=Avlng)=p, - . eee (87) 
and by a similar argument, it is easy to show that 
Aol e'*)=2A0(s*)= peBee ree 


Thus |g-+-| may be approximately evaluated in the neighbourhood of 7, 
for substitution into equation (24). Expanding the modulus and 
neglecting powers of (r—7,) higher than the second, one obtains from 
equations (30) and (36), 


1 aria As cae 1 
| 9(7) A(z) | =p? + fo+ Xp2 ne 2p°B sp («19+ =m) +0 (5) . 
MOA Res5) 
This may be written more simply as 
WG We Vert a eer Veena AN 
where from equations (37) and (38) 
BOL n) sate us: slic: rn. o 
NOT STDs 0 IAS Vl a ee (42) 
A(T m— 7 )?=1/p?B?. mee eal rm ms it''s) 


Near 79, one therefore has from equation (24) 


Pol =Aly{e? x —hp Br — 77}. (4s) 


The asymptotic expansion for I, is 


w 1 1 
Ts)~ Foes + ge +O(z)}, Mobisieninitne’s 5009 


Theory of Radar Information 1011 


and a trivial application of it shows P,(r) to be approximately Gaussian 
near tT) with 


ToT ee ees BIT ere eae AED. 146) 


(ga) = lip *B2, Ke Sra onc Leow atin er (47) 


Thus the standard deviation of + from its mean in P g(7) 18 given by 


d7=1/p8, Ee «eee nc. a eee (48) 


and this represents the a posteriori uncertainty in the measurement of 
range. That the mean value of 7 in P(r) has itself the same uncertainty 
with respect to 7) (equation (43)) is no. mere coincidence, and is what 
one would expect from an application of inverse probability: both 
spreads really represent the same effect. 

Equation (48) is the first result of practical interest. The uncertainty 
is independent of the frequency response of the receiver because it has 
been established without reference to any particular receiving mechanism 
and represents optimum performance. 


§6. AMBIGUITY. 


It was remarked at the end of §4 that the degree of ambiguity of an 
- observation is the probability that, in the ensemble of observations with 
fixed received waveform, the actual range 7 does not lie anywhere near 7p. 
The ambiguity, A, may therefore be expressed as the area under that 
portion of P(r) which is due only to A(z), 2. e. which excludes the signal 
peak discussed in the previous section. This will be calculated by 
multiplying the expected area per unit range of 7 due to h alone, by the 
whole of the a priori interval T. This in fact includes a small range of 
integration near +, which should strictly be omitted, but since for any 
practical system one has the inequality 


Fess (ee mm ir at oe es ie (40) 


this approximation is justified. 
It is convenient to define P,(7) precisely, by the relation 


Pea i Uli aero ue sth. Leay ns (00) 


and this is identical with P(r) defined in equation (24), except near 7». 
The “expected area per unit range’? under P,(r) is, of course, the 
expected value of P,, and may be obtained by considering the probability 
distribution for |/]| given by equation (33). Thus 


BoP P= | POP Pee (51) 
0 


where 
Q(P,)dP,=4( |r| )d| AI. eA vis). ane (52) 
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The two forms of average in (51) are equal because h(r) is a stationary 
random function. The integral may be evaluated by substituting for ¢ 
from equation (33), P,, from equation (50), and using the expansion (45) 
for the Bessel function. Finally, one obtains 


P,=Ae*?(1+-O(p-4)), Me raast es ee, 
and the ambiguity, given by the average area under P,,(7) is 
AZ TP OD... Pica ee a ee 


It only remains to determine the normalizing constant, A, of P(r), which 
depends also on the area under P,(7) due to the signal function. 

This area is obtained by integrating the distribution given by 
equation (44), and using again the Bessel expansion. The result is* 


. Aeg*+1 1 1 ; 
[ Palrdr=“S-{1— 55 +0 (=) }. i Coma) 
Fig. 1. 
AMBIGUITY. 
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The threshold of intelligibility. 


Neglecting terms ef order p~* compared with unity, the normalizing 
equation is 


ee ih P(r) dr-+ J Par) dr 


etl il . 
=m (1 ga) + Bet ee oearet 


Hence, approximately, 


T 52 
A= ee ° . . e (57) 
p’B+ exp ($p?-++1) 

a 
* An average has been adopted in the exponent—see equation (41)—as a 
matter of expedience. The consequences in later parts of the theory are 
unimportant except when A~0-5, where additional approximations are, in 

any case, unavoidable. : 
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Thus, keeping T and 8 constant, as the ratio 4p? of the received signal 
energy to the noise power per unit bandwidth increases, the ambiguity 
diminishes to zero from a value near unity. The transition is fairly 
abrupt as shown in fig. 1, and forms the threshold of intelligibility. This 
may be defined conventionally by A=0-5, and is therefore given by 


bp?+l=logTp’B. . .. .. . . (58) 


When $p?+1 Slog Tp?8, unambiguous reception is ensured. This is the 
second criterion for a useful radar observation. 


§7. INFORMATION. 

The quantity of information obtainable from a radar observation is 
the difference of the entropies (Shannon 1948) of the a priori and 
a posteriori probability distributions for range. Since the a priori 
distribution has been assumed rectangular of width T, its entropy, 
defined by * 

Heat tp loop dre peeseeeer eek (BO) 
is given by 

He=loo le Ce) eae ee ea es (OU) 
The a posteriori entropy required is the average entropy of P(z) over the 
total ensemble of received waveforms. Since, however, the entropy 
is independent of 7, it is sufficient to consider an average over the fixed-7, 
ensemble. As before, that portion of the distribution near 79, viz. P,(7), 
and the remainder, P,,(7), will be considered separately, the total entropy 
being given by addition. 

A difficulty arises, however, in evaluating the entropy of any particular 
distribution P(7) because, in doing so, the correct normalizing constant 
for that distribution should be used, and this will vary throughout the 
fixed-7, ensemble. This makes the integrations difficult to perform, and 
to simplify the treatment a fixed normalizing constant A, dependent on 
average properties of P(v) and given by equation (56), is used. 

Fluctuations in A are due, when p*>1, almost entirely to P,(7) and 
their neglect is justified in either of two circumstances : 

(a) When the total area under P,,(7) is small, 7. e. A~0, and the entropy 
is approximately that of P(r) alone, or 

(b) When the total interval, T, over which P(r) extends, is sufficiently 
large for P, to include an adequate statistical sample of those of its 
ensemble properties which influence the entropy. This condition amounts 
to A~1, though the derivation is lengthy and is here omitted. The 
entropy then arises almost entirely from P,(7). 

Thus one may first calculate the average entropy due to P,(7) alone, 
which will represent that of P(r) when A~0; then that due to P,(7) 
alone, giving the entropy of P(r) when A~1; finally, an approximation 
in the intervening (threshold) region will be made by ignoring fluctuations 
of the normalizing constant. 


* Here p(z) signifies any probability distribution. 
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The contribution of P,(r) to the entropy, is obtained by substituting 
the averaged expression for P(r) given by equations (44) and (41) into 
equation (59), using the expansion for I), and integrating. This leads to 


eye Be | eatin o(=)}. ‘261 


But if A~0, from equation (55) the normalizing constant is given by 


dee t1 1 
——— | be.) ee ee 
pp ( 53) 
whence from equation (61) 
a/ (27) (5) 
AvH ,= log —.— +0(-}. vl .cet bowen enka? 
UA vag 28 “wep writing 


The information content is given by the difference of the a priort and 
a posteriori entropies, viz. 
Il=Hj—Hyi=loo T—Hy 2 ee 


and hence 


) T 
I~ log oe, (A~0)+n42 4, shee ea 


Since P,(7) is a stationary random function, one may calculate its 
average contribution to the total entropy by multiplying the expected 
value of P, log P, by the interval T of integration. Thus, ignoring 
fluctuations of A, 


AvH,=—T | Q(P,)P, log P,dPy = =. ws (68) 
0 


Hence, substituting for Q(P,,) and P,, from equations (52), (33) and (50), 
using the expansion for I), and integrating, one. obtains 
A 1 
AvH ,=—ATe?? {lo 244310 (5) \ : iy (GR 
h 5 oy (am) oP. eae p2 ( ) 


But when A~1, using equations (53) and (54), the normalizing constant A 
is given approximately by 


Act LAT Ls a en ee 
Thus, 
: 1 
AvH = log (Tpy/(2n))—3p*—4+0 @ SNK) 
and 
I~ dp? log (pV/(2n))+4, (ANI)... . (10) 


Equations (65) and (70) thus give the asymptotic approximations to the 
information content under conditions of unambiguous and ambiguous 
reception, except in the vicinity of the threshold at A—0-5. 
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When A approximates neither to zero nor unity, it is, of course, 
necessary to normalize P(r) taking into account both P,(z7) and P,(7), 
and incidentally to ignore—as explained above—normalizing fluctuations 
which should in this region be included. The value of \ taken from 
equation (56) is substituted into equations (61) and (67), addition of 
which gives the approximate average a posteriori entropy H,. Finally 
subtraction from H, gives 


I~— log { V(@x0) (pent Ts) i +0(4) AWAD <tpeaaee (TL) 


It will be seen that this expression reduces, as it must, to equations (65) 
or (70) according as the second or first terms in the logarithm predominate, 
t.e. A~O0 or 1. It represents a better approximation to the total quantity 
of information than either of the previous equations, even though 
fluctuations of the normalizing constant are ignored. 


Fig. 2. 
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§8. Discussion. 


The chief results of the foregoing theory are illustrated in fig. 2, where 
contours of information content I, given by equation (71), and ambiguity A, 
given by equation (57), are plotted against the quantities log,(T) and 
B/N,. It will be seen that the I-contours behave in a markedly 
different manner on either side of the threshold of intelligibility (A=0-5). 
When the ambiguity A is small, the I-contours are asymptotic to dotted 
curves given by equation (65). The information here represents almost 
unambiguous knowledge of target range, subject to a ‘ topological ” 
error which is shown in §5 to have a standard deviation of 


d7=1/p8, 


here 
p= (2E/No) 
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and f is the “ effective bandwidth ” of the transmitted waveform. As 
would be expected, the error diminishes as the total received energy E 
increases, if the noise power N, per unit bandwidth is constant. It also 
diminishes as 8 increases, but increasing 8 alone in order to obtain high 
resolution, has its drawbacks because the observation gradually becomes 
more ambiguous. 

When the ambiguity is greater than a half, the I-contours are 
asymptotic to the dotted vertical lines given by equation (70), which 
represent the upper bound to the information content when the total 
received energy is fixed. In Shannon’s general theory of communication 
(Shannon 1948) there is a similar upper bound, with which comparison 
may be made. Quite generally, the quantity of information I which can 
be extracted from any received message of energy E is limited to E/No 
natural units. This result is obtained from equation (1) by writing 


I=Cit=We log ( ) nat. units, 


see 
T Nowe 

where ¢ is the duration of the message. As Wé+oo, I approaches the 

asymptotic value 

1,=B/Ny= 3p. 


Ss 


The corresponding radar limit, as6—> oo, is less than this by log (py/(27)) —4 
natural units of information, and the discrepancy may be shown to 
correspond to the fine-structure information which was deliberately 
discarded in passing from equation (23) to (24). If radar could operate 
without a high-frequency carrier, this effect would not arise and the 
asymptotes in the ambiguous region of fig. 2 would exactly equal 
Shannon’s theoretical limit. . 

In normal reception, the quantities 8B, No, and the a priori interval T 
are constant, while E increases with the time of observation. One 
therefore starts beyond the left of the diagram and moves steadily to 
the right along a horizontal path. At first, the amount of information 
increases rapidly as the ambiguity is steadily removed. After crossing 
the threshold into the useful region, however, the information increases 
only very slowly and represents a progressive improvement in accuracy. 
This phenomenon is in full accord with the ideas underlying information 
theory. Once the approximate position of the target is known without 
ambiguity, further incoming energy is continually contributing information 
which is already partly known ; the resolution in range is increased by 
continued emphasis rather than by a systematic process of elimination. 
For this reason, the most efficient radar system should work as close 
to the threshold of intelligibility as possible—how close depending upon 
the degree of ambiguity which can be tolerated. 

It must be emphasized that a phase-coherent received waveform is 
assumed in this paper, and this severely limits the practical application 
of the theory. Ifthe range parameter 7 were itself a function of time, the 
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time-integrals in the theory would have to be evaluated differently. 
As matters stand, the total period of observation must be restricted to 
an interval which is sufficiently short for 7 to be treated as a constant. 
Under these circumstances, there is nothing to choose (except on practical 
grounds) between concentrating the available energy into one repetition 
period of the modulation or spreading it out over several. There is little 
doubt, however, that the latter course is wasteful of energy when long-term 
phase-coherent integration cannot be utilized (Lawson and Uhlenbeck 
1950). 

The bandwidth of the receiver does not appear in the results of the 
theory. As long as it is sufficiently wide to include the whole of the 
transmitted spectrum, no information is gained or lost by increasing it, 
and none can be gained by reducing it. The choice of frequency response 
at the receiver depends only on the best way to extract, and present to the 
observer, all the latent information in the incoming waveform, with 
which practical problem this paper is not concerned. 
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SUMMARY. 


In an earlier work (Tolansky 1944), a description was given of the use 
of multiple-beam interferometry to determine relative phase changes on 
reflection at the silvered surfaces of an interferometer. This method 
has been extended and critically examined. It is shown that the method 
does not approach the accuracy of the older polarimetric methods, and 
that its optimum use is restricted to reflecting surfaces of fairly high 
reflection coefficient. Within its limitations the method has advantages. 


Measurements of the relative phase changes on reflection at a number 


of layers of silver, copper, and tin have been made, together with measure- 
ments of the ellipticity of the light reflected at these layers. The variation 
of these quantities with the thickness of some layers of silver has been 
discussed in relation to the use of evaporated layers for the measurement 
of the optical constants of.bulk materials, and the optical constants of 
evaporated layers of copper and tin have been determined and the results 
discussed. . 


Part |: THe INTERFEROMETRIC TECHNIQUE FOR THE MBASUREMENT 
OF RELATIVE PHASE CHANGES. 


In 1944 Tolansky described the appearance of the multiple-beam 
interference fringes formed by a highly silvered lens and plate combination, 
when the angle of incidence of monochromatic light falling on to the 
interferometer varied from 0° to 80°. At normal incidence the fringes 
took the form of rings (Newton’s rings), but as the angle of incidence 
increased these of course became elliptical, but further each became a 
doublet, the separation of the doublet components increasing with the 
angle of incidence. The components were found to be polarized in 
mutually perpendicular directions and the origin of the doubling was 
shown to be the effective difference in gap of the interferometer for the 
two components, due to the difference between the phase changes suffered 
on reflection by the light polarized parallel and perpendicularly to the 
plane of the incidence. ‘ 


* Communicated by Professor S. Tolansky. 


on cone Ss erate 
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It was apparent that this was a new method for the measurement of 
the relative phase changes occurring on reflection, and Tolansky measured 
this quantity for the particular silvered surfaces of his interferometer. 
No further examination of this method has been reported and it is the 
purpose of this paper to study the method critically both theoretically 
and in the light of a number of measurements of relative phase changes 
made by this technique. 


The Theory of the Method. 


Although Tolansky’s original experiments were carried out on silver- 
modified Newton’s rings it is advantageous to treat the simpler case of 
straight line fringes formed in a. wedge interferometer composed of two 
_ plane reflecting surfaces, inclined at a small angle. 

Consider first monochromatic light, of wavelength A, incident at angle 6 
on a wedge interferometer with an air gap. The fringes (of equal thickness) 
will take the form of lines parallel to the line of intersection of the two 
reflecting surfaces. If the angle of incidence is not zero, each fringe will 
in pots have two components, and if the fringes occur at distances 2,, x 


«,, &,, etc. along the wedge, it can be shown that the relative phase change 4 
can be determined from 
WANG A ae Mata Oa NE pan) at a, oR. Tey eet ok) 


An equation from which 4 may be determined can also be derived in 
the case of fringes of equal chromatic order (Tolansky 1945). These 
fringes are formed when an interferometer ,is illuminated with white 
light, and a small section of the interferometer is projected on to the slit 
of a spectrograph. The section is small enough for the value of ¢ to be 
assumed constant over it, if the wedge apex is parallel to the spectrograph 
slit. In the focal plane of the spectrograph fringes appear, which in this 
case take the form of straight lines parallel to the slit. Again, if @ is not 
zero, the fringes are doubled, the components having different wavelengths. 

Let the wavelengths of the components of two adjacent doublets be 

Ne het N= Then the relative phase changes given by 


‘s? “\p? 
J OnE ered Wie coolialine ie Fae Mae AE) 


and hence J can be found from measurements of A, and A, if m, the order 
of the fringe, is known. 
The order, m, is given by 


MA,»—(m+1)=2,(8, Ap) 2, (0,4)... (3) 


where the function ®, represents the change of phase suffered on reflection 
by the parallel ce nee In the absence of published data describing 
the variation of the function with wavelength and with angle of incidence, 
the order of the fringe m cannot be determined without making an 
assumption of the form of ®,. This is true not only for fringes at other 
than normal incidence but also for all uses of fringes of equal chromatic 


order, 
ee 
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If we assume 2@,(0, A)=8A where f is a constant fraction, then as, 
closely enough, 
Np ps) ee 
gives M—Pp=A [(ARAjes 0 ol oe ie ee 


Now m, the order of the fringe, is integral, and the quantity m—f, 
which can be determined from measurements of the mean wavelengths A 
and 2’, is in general not integral. Moreover, the sign of the fractional 
part is indeterminate. 

However, from measurements on fringes of equal thickness we can find 
the relative phase change 4 for one wavelength unambiguously from 
equation (1), and so if the value of m for a fringe of equal chromatic order 
at that wavelength be chosen to give the correct value for 4, the values 
for the orders of the other fringes of equal chromatic order at different 
wavelengths can readily be assigned, and the fringes used to measure the 
variation of the relative phase change with wavelength. In addition 
fringes of equal chromatic order, being formed in a very small section, 
of the interferometer of assumed constant thickness, have a constant 
dispersion leading to improved accuracy of measurement. As against 
this latter point however, it should be noted that when fringes of equal 
thickness are used, the whole surface of the interferometer is covered, 
with perhaps a desirable averaging effect of the properties of the films. 

The experimental arrangement and the interferometric technique have 
been adequately described elsewhere (Tolansky 1948) and no treatment 
will be given here. The discussion which follows of the accuracy of 
measurement of relative phase changes from measurements of the doubling 
-of interference fringes of non-normal incidence will assume that the inter- 
ferometer has been used in such a manner as to give the best interference 
fringes obtainable with the particular reflecting surfaces employed. 


The Errors in Measurement. 


In figs. 1 (a), (b) and (c) (Pls. XXXII. & XXXIII.) are shown 
photographs of fringes formed at varying angles of incidence in 
interferometers whose surfaces had reflection coefficients 90, 50, and 20 
per cent respectively. The photographs are all of fringes of equal 
thickness, using light of wavelength 5461 A. 

The previously established increase in separation of the fringe doublet 
as the angle of incidence increases from zero is clearly shown. 

Three striking features of these fringes are immediately apparent, 
all of which are of importance in assessing the accuracy with which the 
doubling can be measured, and the relative phase change determined. 

First, comparison of figs. 1 (a), (b) and (c) (Pls. XXXII. & XXXTIT.) 
shows that the width of both components of the fringe doublets increases 
rapidly with decreasing reflection coefficient. This of course is to be 
expected and constitutes one serious drawback to the generalized use of 
the technique, since the highest accuracy is only obtainable when measuring 
the effect on surfaces of reflection coefficients of over 85 per cent, 


Properties of Evaporated Layers of Silver, Copper and Tin 1021 


Secondly, as the angle of incidence increases there is a considerable 
difference in fringe width for the two components for all values of the 
reflection coefficient, and thirdly, the maximum intensity of the sharper 
component decreases rapidly at the higher angle of incidence. 

Tolansky suggested that the difference in the intensities was evidence 
for the existence of some preferential absorption by the silver film. 
However, reference to fig. 2, which shows the variation with incidence 
angle of the reflection and transmission coefficients of a typical silver film, 


Fig. 2. 
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The variation with incidence angle of the reflection and transmission coefficients 
of a typical silver film. 


shows that for these films, whilst some slight preferential absorption 
exists at the higher angles, it is in the opposite sense to the effect observed. 
The difference in the maximum intensities of the two components arises 
mainly from the difference in the transmission coefficients in the two 
cases, the maximum intensity being governed by the ratio 1/1+(A/T)? 
where A is the absorption given by A=1—(R+T). The values shown 
in fig. 2 are typical for silver films of high reflection coefficient. The 
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differences in fringe width are of course due to the widely differing values 
of the reflection coefficient for the components. This effect increases with 
decreasing normal incidence reflection coefficient. ; 

The accuracy with which the doublet separation can be measured is 
affected by the differences in width and intensity of the components in two 
ways. Not only is it impaired by the increase in fringe width due to the 
lower reflection coefficient of the “‘ parallel > component, but also this 
width, coupled with the weaker intensity of the “ perpendicular _com- 
ponent, causes the visibility of the latter to be poor. This obscuring of 
the weaker coniponent is not serious for high reflection coefficient films, 
but becomes so when the reflection decreases to values of 50 per cent or 
less (figs L(¢); Pk XOX XIIT.): 

The errors all increase with decreasing reflection coefficients. With 
fringes of equal thickness and fringes of equal chromatic order, in trans- 
mission, determinations of relative phase change have been made for 
surfaces of reflection coefficient varying between 20 and 94 per cent. 
The experimental error in the determination of relative change of phase, 
expressed as a fraction of the wavelength of the incident light, varies 
from +0-:002A for the optimum case of R>80 per cent, to +0-005 2 
for R<30 per cent. If the relative phase change is expressed as an angle 
this corresponds to +0-7° for the optimum, and +2° for the worst case 
studied. 


The Limitations of the Technique. 

The accuracy obtainable with this method for measuring relative phase 
changes (--0-7°) is inferior to the figures of +0-001 A=0-36° readily 
obtainable by Drude’s method (Hass, 1938) and with the error of +0-1° 
in the estimation of the angle of principal incidence by O’Bryan (1936). 

From the point of view of the use of the measurements for the determina- 
tion of optical constants, however, the higher accuracies obtainable in the 
older optical methods are not always required, since large local irregu- 
larities often arise in the preparation of the reflecting surfaces employed. 
Clearly under such conditions high intrinsic accuracy of measurement is 
meaningless physically. It was to overcome these errors that the use of 
evaporated layers as characteristic reflecting surfaces of metals was 
introduced (O’ Bryan, Hass, loc. cit.). However, this technique does not allow 
of one main advantage of the use of evaporated layers to be gained, namely 
the measurement of the properties of such layers in vacuo. Possible 
deterioration of the layers by atmospheric corrosion during measurement 
could be avoided by the introduction of an oil between the interferometer 
plates. 

However, a principal limitation of the method remains. Relative 
phase changes occurring on reflection at low reflectivity metals such as 
copper, tin, chromium, etc. can only be measured with half the accuracy 
available for measurements on silver and aluminium, due to the increase 
of the errors with decreasing reflection coefficient. 
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As against these limitations must be set some advantages | over the 
earlier method. The method requires no expensive polarimetric equipment 
and the experimental arrangement is simple. The doubling of the fringes 
can be photographically recorded and measurements made at leisure, as 
opposed to the need for laborious visual observations necessary with most 
of the polarimetric methods. This is particularly the case when it is 
desired to measure the variation of the relative change of phase with 
wavelength, since the whole visible range can be covered with .a single 
photograph of fringes of equal chromatic order, whereas a dozen or more 
visual observations with a half-shade device would be necessary with the 
older methods. . 


Part IT: THe Optica, MeasuREMENTS. 

The interferometric data are coupled with measurements of the 
ellipticity of the light reflected at the layers, made with a simple 
photoelectric photometer. An investigation was made as to whether 
the magnitude of the effects occurring on reflection at evaporated layers 
having the finite transmission required for the multple-beam method were 
the same as those associated with reflection at the bulk metal. The 
variation of the phase and intensity quantities with the thickness of the 
evaporated layers has been measured to supplement existing information 
regarding the normal incidence reflection characteristics of evaporated 
layers. 

Measurements have been made for a number of evaporated layers of 
silver of varying thickness and the results served to define the limits 
within which the thickness of a metallic layer must lie if it is to be used 
for the determination of the optical constants of the metal. With the 
increasing use of evaporated layers in this connection (e. g. O'Bryan 1936, 
Hass 1938, Archard 1949) these limits are a matter of some significance, 
particularly when the interference method is used for the Bene na of 
the relative phase changes. 

The layers of silver, and of the other metals used, were all produced 
by vacuum evaporation from molybdenum filaments at pressures less 
than 5xX10->mm. of ‘Hg, after the glass disks on to which they were 
deposited had been cleaned both by hand and by ionic bombardment 
in the evaporation chamber. The rates of evaporation were such that 
opaque layers of the metals could be formed in thirty seconds. 


Measurement of Relative Phase Changes and Hllipticities for Reflection at 
Silver Layers. 

The first series of silver layers produced were all of the order of thickness 
used for usual interferometric work, i.e. between 400-500 A. thick and _ 
having transmission coe‘ficients of up to 4 or 5 per cent. The variation 
of the ellipticity p with the angle of incidence of light for one of these 

‘layers is shown in fig. 3, and the variation of the relative phase changes 
with incidence angle for several layers in fig. 4. The error in the measure- 
ment of the ellipticity throughout the work was -+-0-01, the error in the 
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measurement of the relative phase change, 4, varied with the reflection 
coefficient of the layers, and for the present series was =-0-002 A. The 
measurements of ellipticity were made for the light passed by a filter giving 
a mean wavelength of 5400 A. and a band width of roughly 300 A., and 
of relative phase change for the mercury green line 5461 A. 

Taking the values found by Hass (1938) for the optical constants of 
silver (n=0-177, k=18-60, A=5461 A.) the ellipticities and relative phase 
changes. to be expected on reflection at a silver surface have been calculated 
using formule due to Drude (1890). These theoretical values have been 


Fig. 3. 
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plotted in figs. 3 and 4, and in addition, the angle of principal incidence 
for silver (for which the relative phase change on reflection is equal to 
A/2 or 0-25) found by O’Bryan (1936) is also shown in fig. 4. 

It is seen from these curves that whilst the ellipticity for a layer having 
a transmission coefficient of 0-04 differs from the theoretical value for 
reflection at bulk silver by nearly one per cent at the higher angles of 
incidence, the relative phase change values, taken for some ten layers 
having transmission coefficients between 0-01 and 0-07 do not differ from 
the theoretical value to this extent. This indicates that the value of the 
ellipticity of the light reflected from evaporated layers is more sensitive 
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to film thickness than is the relative change of phase on reflection. This 
will be borne out completely when the measurement of the variation of 
these quantities with thickness is considered. 

In fig. 4 are plotted the earlier measurements made by Tolansky (1944) 
for the relative phase changes occurring on reflection with a given silver 
layer. In the neighbourhood of 65° incidence, there is a sudden change 
in the slope of Tolansky’s curve. This effect has not been observed with 
any of the films examined in this work, and has not been completely 
explained. One possible explanation is that the films used by Tolansky 
were comparatively old (at least one month after preparation) and the 
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The variation of the relative phase change on reflection at “thick” silver 
films with the angle of incidence. 


effect may be due to surface contamination of the layer due to age. In 
the present work measurements were made on the layers within one hour 
of production in each case. Below 65° there is reasonably close agreement 
between the two sets of data. 

The variations of the quantities p and 4 with the thickness of the silver 
layer varying between 100 A. and 500 A. are shown in figs. 5 and 6 for 
an angle of incidence of 65°. The errors in the values for p are +-0-01, 
but the errors in the relative phase changes increase from --0-002 for 
the thickness layers, to +-0-005 A for the thinnest ones, due to the decreasing 
reflection coefficient of the layers. The thicknesses of the layers were 
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measured by the interferometric technique developed by Khamsavi and 
Donaldson (1947), and the values are accurate to +15 A. The mass of 
silver present per unit area of the film is also given, since the thickness 
measurements may be open to question (Avery 1949). 
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The variation of relative phase change with thickness for silver films. 


The theoretical values of p and 4 were calculated using formule 
similar to those developed by Winterbottom (1948). 

Apart from a slight divergence in the thickness region about 250 A., 
the values of the ellipticity measured agree with the theoretical values 
down to layers of about 150 A. in thickness, and marked differences 
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between theory and experiment first appear at thicknesses below this 
value. This agrees well with the variation of the normal incidence 
coefficients with thickness as measured by a number of workers (Rouard 
1949), and the deviation of experiment from theory at the lower thickness 
values is in accord with the observed changes in the optical constants 
of silver layers of lesser thickness. The relative phase changes (fig. 6) 
show no similar divergence, though the measurement of d for the thinner 
films is correct only to 5 per cent. 

If reflection at an evaporated layer is treated as equivalent to reflection 
at the metal in bulk when the values of p and 4 do not differ from the 
values characteristic of the bulk metal by more than one per cent, then 
fig. 6 shows that as far as the measurement of relative phase change. is 
concerned, the layers need be no more than 350 A. thick for this. For 
silver this corresponds to a transmission coefficient of 6 to 8 per cent. 
Such a thickness gives bright good multiple-beam interference fringes. 
Since the deviation of the value of 4 from the bulk metal value as the 
thickness decreases is probably due to interference effects within the 
layer, this implies that the optical transmission of the layer is the deciding 
factor of the usefulness of the film for ‘“‘ bulk’ measurements, not the 
physical thickness. 

Fig. 5 shows that the’ variation of the ellipticity with thickness is 
more rapid, and deviations of one per cent from the bulk metal value are 
found for thicknesses below 450 A., that is for silver, for transmissions 
of no more than one or two per cent. Whilst for silver this represents 
no hindrance to the use of the same layers for the measurement of both 
the quantities p and 4, metals which form layers of a greater absorption 
than silver layers need a greater transmission than this if reasonably 
intense interference fringes are to be obtained. . 

If a simple photometer to measure p, combined with’ an interference 
technique to measure 4, is used to determine optical constants, or for 
other reflection studies, it seems that it is most suitable to prepare two 
sets of layers. One set should be opaque or very nearly so, and used for 
the measurement of the ellipticity, and the other set should have trans- 
missions of the order of five per cent, and used in the interferometer to 
measure the relative phase changes. Where a polarimetric method is 
being used to determine p and J in one operation and the layers are 
produced and measured immediately 7m vacuo, the layers must have 
transmission of less than one or two per cent for the layer characteristics 
to be within one per cent of those of the bulk metal. . 

Electron microscope investigations have shown (Levinstein 1949) that 
not all metals can be deposited on to substrates at room temperature to 
give layers with structures corresponding to the bulk metal. Thus the 
optical properties of evaporated layers, particularly those of the low 
melting point metals, may differ from those of the bulk metal due to 
differences in crystalline structure, even in layers thick enough for the 
interference effects within the films to be neglected. 
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The Optical Constants of some Hvaporated Layers of COP R Gs 

Copper films were deposited by evaporation, four layers being ee 
almost opaque and four with transmissions of up to five per cent. The 
values of the ellipticity found from the thick layers are plotted against 
incidence angles in fig. 7 and the values of relative phase change found 
interferometrically from the slightly transparent layers are shown in 
fig. 8. The errors in 4 are 0-003, due to the relatively low reflection 
coefficient. of copper (0:57 at normal incidence). Pahoa 

Taking the values of p and J for four angles of incidence, 50°, 55°, 60°, 
65°, the optical constants for the layers were determined using the formule 
of Drude. The value derived for the refractive index is n=1-10-+0-07, 
and for the absorption coefficient k=2-3-+-0-1. 


Fig. 7. 
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The variation of ellipticity with incidence angle for thick copper films. 


Also included in fig. 8 are two values for 4 for evaporated layes of 
copper reported by Lowery, Wilkinson and Sware (1936). Good agreement 
exists within the present values. However the values of the optical 
constants calculated by Lowery, n—0-90, k=2:-1, for a single angle of 
incidence differ from those quoted above. 

These are in close agreement with the values quoted in the International 
Critical Tables for A=5400 A., n=1-07, k=2-25, and with those given by 
Winterbottom (1936) for bulk copper, n=1-00, k—2-40. However, as 
Lowery has demonstrated, values found from measurements on copper 
surfaces prepared by polishing are extremely susceptible to differing 
methods of surface preparation, hence the agreement is fortuitous. The 
values for electrolytically deposited copper are n=1-38, k=1-7 8, and for 


é 
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cathodically sputtered layers n—0-90, k=2-70, and so it seems that the 
layers produced by evaporation resemble the structure of the massive 
metal more closely than layers made by setae deposition or cathodic 
sputtering. 


The Optical Constants of Evaporated Layers of Tin. 

Eight evaporated layers of tin were examined at one angle of incidence 
only, 65°. The reflection coefficient at normal incidence was 0-70, and 
the layers were stable in air. 

Fig. 8. 
COPPER 


0-30 ] 


\e 54614 


° 


°o 


20 40° 60 80 
The variation of relative phase change with incidence angle for thick copper 
layers. 


The optical constants derived for a wavelength of 5400 A. were 
n=2-4+0-2, k=1-9+0-1. The accuracy of the values for the constants 
of copper cannot be claimed here since these measurements were made 
at one angle of incidence only. The optical constants for bulk tin (Hedges 
and Homer 1937) are given as n=1-0, k=4-2, and even allowing for 
considerable variations in the values for the bulk metal (tin is soft and so 
its surface would be susceptible to conditions of polish etc.) there exists 
considerable difference between the optical constants of the layers and 
of the bulk. 

The constants for the bulk metal yield theoretical values for p and 4 
of 0-82, and 0-:130A. These values were checked experimentally. Some 
plane surfaces of tin were prepared by allowing drops of the molten metal 
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to fall on a hot glass plate, and to cool there slowly. When separated 
from the glass the tin surface had the finish of the glass plate, and also was 
presumably free from any distortion due to cold working. The ellipticity 
of the light reflected from these surfaces was measured, and the relative 
phase change was determined using the interferometric technique in 
reflection. 

A photograph of the interference fringes obtained by reflection in an 
interferometer formed between a freshly prepared evaporated silver 
layer (R=0-88, T=0-06) and the cast tin surface (R=0-80) is shown in © 
fig. 9. In order to use reflection fringes in. this connection it is desirable 
to improve the visibility by polarizing the two halves of the field at _ 
right angles to one another and to observe the doubling along the line 
between the halves. The accuracy of measurement on these fringes is 
not equal to that obtainable with transmission fringes, but it is sufficient 
for the present case. 


Fig. 9. 
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Reflection fringes at 65° incidence in the surface of a cast tin specimen. 


The results of this experiment together with the experimental values 
for the layers, and the theoretical values for bulk tin are included in 
the Table below. 


TABLE. 
p A/a R n k 
Theory Bulk 0-84 0-133 ' 0-82 1-0 4-2 
Exp. Bulk (cast) 0-82-++0-01 0:13-L0-01 0-80 


Exp. Layers 0-73-40-01 0-101 -+-0-003 0:70 2-4 1-9 
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Fringes of equal thickness in an interferometer at various incidence angles, 
for reflection coefficients of 90 per cent. 
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65° 


Fringes of equal thickness in an interferometer at various incidence angles, 
for reflection coefficients of 50 per cent. 


60° 


Fringes of equal thickness in an interferometer at various incidence angles, 
for reflection coefficients of 20 per cent. 
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The surface structure of the evaporated layers of tin would thus appear 
to differ from that of the metal in bulk. 

Three possible explanations can be put foward to explain the dis- 
crepancy. First it may be that an oxide film was present on all the layers 
of tin examined, and its presence changed the optical properties of the 
layer. However, the layers were very stable in air, and the values of the 
ellipticity and relative phase change were found to be unaltered after 
several days, during which an oxide film might have been expected to 
grow and to cause further changes in the optical properties. 

The second possibility (suggested by Dr. Hedges of the Tin Research 
Institute) is that an alloying action took place between the tin and the 
molybdenum filament used for the evaporation. The percentage of 
molybdenum in the alloy would be very small, since the filament used 
was of strip 0-004 in. thick, and was not visibly affected after fourteen 
separate evaporations from it, but the possibility remains. 

A third possibility is that tin, whilst it forms a stable layer at room 
temperature, exhibiting specular reflection, being a low melting point 
layer might not form layers of a structure similar to that of the metal in 
bulk, even when they are opaque. 
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XCIV. The Magnetic Deflection of Fast Charged Particles in the 
Photographic Emulsion*. 
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[Plate XXXIV. ] 


§I. Summary. 


Electron Sensitive Nuclear Research Emulsions have been exposed 
at sea level, and at the Pic du Midi (2800 m.) in magnetic fields of the 
order of 34,000 gauss, in order to test the possibilities of curvature, within 
the emulsion itself, of the tracks of charged cosmic ray particles. 

The existence of measurable deflections of long tracks has been 
established by comparison of the results of measurements on field and no 
field plates, and the method has been applied to determine the sign of * 
particles in a group of cosmic ray phenomena in the exposures at high 
altitude. 


§ II. IntRopvcTION. 


In a dense medium such as the photographic emulsion, the multiple 
scattering of a charged particle is so high as to mask the continuous 
curvature produced by a magnetic field except under high fields and on 
long tracks. 

The angle of magnetic curvature 6,, is given by 

0.= ee Gis a aT: degrees, 
7 Pp 
where H is the magnetic field in gauss, p is the momentum in MeV./c, 
t is the track length in cms. 

The arithmetic mean angle of scattering is given by §,=100-4./tG/pB 
degrees where 100-4 is a constant calculated for the emulsion, 8 is the 
ratio of the velocity to that of light, and G is a slowly varying function 
of 6 and ¢ of the order of 4. Hence the ratio of the magnetic deviation 
to the mean scattering angle is 


6,,/9,=BH4/t/6050 G=yB/t .-. . . ss (Ql) 


* Communicated by Professor P. M. 8. Blackett, F.R.S. 

+ Centre de Physique Nucléaire, Université Libre de Bruxelles. 

t University of Manchester. 

i University of Manchester, on leave of absence from Centre National de la 
Recherche Scientifique, Paris, 
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where y, is a slowly varying function of track length and of B and is 
proportional to the field H *. 


For a field H=34,000 gauss, and for a relativistic particle of track 
length 1 cm. yp 1-46. 

In fig. 1 the sloping line shows the quantity 0,E/t=100:44/4/ tas a 
function of track length. The horizontal lines show 6, H/t for relativistic 
particles, for various field strengths. The figures show that for H= 30,000 
gauss the ratio 64/#,=1 when t“5 mm. This indicates the order of 
length of track required to separate the magnetic deflection from the 
scattering deflection for relativistic particles. For slower particles the 
length must be greater by a factor 1/8?. 


Fig. 1. 
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Plot of ,H/t as a function of track length t. The horizontal lines show 6,,E/t for 
relativistic particles. 


» In the following paragraphs we discuss how the magnetic deflection 
may be separated from the scattering and used to determine the 
sign of the particle, to improve the estimate of momentum, and to 
estimate the ratio of deflection to scattering Byyv/t. Since yo is nearly 
independent of the type of particle and of its momentum, its mean value 
for a number of tracks may be regarded as a test of the scattering theory, 
or as a calibration of the plates. 


* This definition of 6, and of y, differs from that of Moyal (1950) by a factor 
4/7/2 since 6, is the root mean square scattering angle in his paper. 
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§ III]. MetHops oF ANALYSIS. 


(1) Estimation of the Curvature. 


To measure magnetic deflection and scattering we use the customary 
method of dividing the track into a set of equal cells, and we measure the 
direction 0, of the track in the ith cell with respect to an arbitrary origin. 
The deviation €,—0,,,—0,; between two successive tangents to the track 
is the sum of a constant magnetic deflection and a random scattering 
deflection with zero mean. 

Except at highest momenta we may neglect measurement errors for 
the present discussion. If we assume the distribution of scattering to 
follow a Gaussian law, the best estimate for the curvature is the sum of 


n 
this set of independent measured deviations 2€,. 


1 

This sum is equal to the total deviation between tangents at the two 
ends of the track * 6=(8,— 4). 

This result is confirmed by a more refined analysis, taking into account 
small measurement errors (Moyal 1950) or using sagitta measurements 
(Scott 1949). 

The actual distribution of scattering is closely similar to a Gaussian 
distribution, but has a rather large “single scattering tail’’. The effects 
of this tail are most simply eliminated by subtracting large deviations 
from the total deflection in estimating the curvature. 

The standard error on the estimate of the magnetic deflection 6,, from 
the total deviation ¢ is the sum of the independent errors, the scattering 
error (47)*@, and the measurement error s, on each of the first and last 
settings. 


i.e. 0,,=b+ (4797+ 28?)!. 
In most cases 283 < 4767, so we may write 
On =P ($77) *,. 


The relative error on the determination of the momentum p from the 
magnetic deflection is therefore approximately 


e(p) <(0,,) G. 7 / 
Dp = Bin we J3 : YBYoV 2 ; ; ; . (2) 


$m Ss 

* This result depends on the statistical independence: of the successive 
deflections €;. Ifthe track were a perfect arc, and measurement errors were the 
only cause of scattering, the actual observations 6;, and not the deflections é; 
would be independent. The best estimate for the curvature would then be given 


ie : 2 "2 
by the mean value of the deflection over half track distances, ¥ ( 0n, .—6; 

i . . . . 3 1 he 
since this statistic involves each of the x independent observations with equal 
weight. These conditions are encountered in cloud-chamber measurements 
on particles of high momentum, and may exist for such particles in the plate 
if higher fields become available. 
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The errors on the determination of momentum from scattering are 
discussed in paragraph (5)..The variation of these errors with energy of 


the particle is shown in fig. 7, where a Vt is plotted against pf for 


typical values of the measurement errors. The horizontal lines show 
the relative error from magnetic curvature, together with the form of the 


curves for protons and 7 mesons, allowing for their becoming non. 
relativistic. 


Fig. 2. 
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The proportion S per cent of particles of track length t, which, owing to multiple 
scattering, will give a total deflection in the wrong sense, as a function of 
t for various field strengths. 


It can be seen that over wide ranges of energy the momentum is 
determined with greater precision from scattering observations than from 
magnetic deflection in moderate fields. However, for very high fields 
(of the order of 100,000 gauss), and for high momentum particles, the 
estimate from magnetic deflection is more accurate. 


(2) Determination of the sign of particles. 


In many tracks from cosmic-ray events the direction of motion of the 
particle is known. The determination of its sign then depends simply 
on the sense of its deflection in the magnetic field. Owing to the super- 
posed scattering there exists a certain probability that a particle will 


4B2 
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have a total deflection in the wrong sense. This probability is a function 
of track length and of magnetic field, and is plotted in fig. 2. Here it 
will be noticed that for a track of 1 cm. and a field of 34,000 gauss, 
the probability is 12 per cent. 

If, however, the momentum can be determined with fair accuracy 
from scattering measurements, or from other considerations, we may 
frequently assess the sign with greater accuracy. For consider fig. 3. 
Here are shown two distributions representing the probability distribu-. 
tion for the total deflection ¢ if the particle has a positive or negative 
sign respectively, the mean value 0, being predetermined from the 
knowledge of the momentum. If we obscene for some particular track, 
a total deviation ¢, then clearly the ratio of the probabilities of the sign 
being positive or negative is given by the ratio of the ordinates of the two 
distributions at the position 4, 2. e. 


P(+|¢) _ 
P(—1d) —AN/BN. 


Fig. 3. 


+5 


= -5 = 
10 en ae. 


The probability distributions for the total deviation for a positive or negative 
particle whose mean is known from scattering measurements to be 0, . 


This assumes that a priori there are equal chances of the particle 
having either sign. A more rigorous analysis of the situation is given 
by Moyal. With p(+-)/p(—) the ratio of the a priori sign probabilities, 
his conclusion is expressed by 


+ |) ={14 +E exp — Eyl a 2 LS) 


where y= $/0,4/ ¢ is the estimated value of yp. 
In the present experiment we assume the ratio p(-++)/p(—) to be 
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always unity. Fig. 4 shows the proportion of positive particles of length 
¢ which will yield a positive sign with probability P (+ |%) greater than 
the fixed values marked on the curves for a field of 34,000 gauss. 

It should be pointed out that in the event of a track occurring during 
the inevitable “no-field time’ and being believed to be a field track, 
one will, in general, attribute a sign with a probability greater than 50 per 
cent. The fraction of no field tracks which would give sign probabilities 
greater than the same fixed values as in fig. 4 is shown in fig. 5. It will 


Fig. 4. 
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90 
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The proportion of particles, of track length ¢, which will give the correct sign 
with probabilities greater than R per cent for H=34,000 gauss and for 
various values of R, as a function of t. 


be seen that this fraction can be rather high for long tracks. For this 
reason it is important to reduce to a minimum the no-field time, and to 
keep the plates horizontal while out of the field. 


(3) Hstimation of Scattering. 

To estimate the multiple scattering, the direction of the mean line 
through the grains in each cell is carefully measured, using an accurate 
eyepiece goniometer, fitted with a suitable graticule™. 


* The method of measurement of sagitta, described recently by Fowler (1950), 
could also be used for these measurements. We are now investigating the 
advantages of the two methods. 
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Two methods are available for the calculation of the momentum from 
the measured angles. Hither the arithmetic mean deviation, or the root 
mean square is calculated. In either case the effects of the single 
scattering tail are eliminated by applying suitable weighting functions, 
as for example, for the mean deviation, a sharp cut off of all angles greater 
than four times the mean. Such a sharp cut off can be shown to introduce 
a much greater uncertainty in the root mean square than in the mean 
deviation. 


Fig. 5. 
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The proportion of particles of track length ¢ which, in the absence of a field, 
may be expected to yield a sign probability (correct or incorrect) greater 
than R per cent for various values of R. The proportion which will yield 
the incorrect sign with these probabilities is, of course, just one-half those 
given by these curves. 


(4) Determination of the Errors. 


Errors of two types occur in the measurements of multiple scattering 
and of the total deflection. 

(a) Distortion of the emulsion and inaccuracy of the microscope stage. 

(6) Measurement errors, which we call spurious scattering or noise ; 
statistical errors, due to the finite number of cells available ; and an 
uncertain amount of “ smoothing’’, a systematic reduction in scattering 


angles due to the treatment of the track as a straight line within each 
cell. 
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The first type of error is more serious for determination of the magneti¢ 
deflection, the second_in the determination of multiple scattering. 
The emulsion, especially when wet, is easily disiorted. Even slight 
distortions could limit the measurements to particles of low momentum. 
Their effects can be reduced by using thin emulsions, but only at the 
expense of the frequency of long tracks. Thin plates were used for the 
sea level exposure, but recent advances in the processing of thick emulsions 
(Dilworth e¢ al. 1948 and 1950) and in the measurement of distortion 
(Cosyns et al. 1950) made possible the use of thicker emulsions for some of 
the later experiments. Measurements of distortion on both sets of plates 
are described in §IV., where it is shown that they have been reduced 
to a satisfactory level. For the measurements on high energy tracks the 
movement of the microscope stage is controlled throughout the measure- 
ments by an optical method. Both Cooke and Baker microscopes show 
a deflection less than 0-4’ over a displacement of 20 mm. 

The determination of the second type of error depends on the method 
of analysis used. 

In using the root mean square determination of scattering, systematic 
effects of smoothing and of spurious scattering may be eliminated by the 
method suggested by Moyal. He points out that true and spurious 
scattering have a different statistical nature. True scattering yields a 
set of independent deflections €, while the presence of measurement 
errors on each setting @; is shown to introduce a negative correlation 
between the successive €,'s. On the other hand, smoothing introduces 
a positive correlation of a similar nature. Both effects are eliminated 
on the average by using as an estimate for the scattering 


nG?—=(LE—d?/n)+2(26,6,,,-¢2/m). 2. (4) 


The terms containing the total deviation ¢ are small corrections to 
allow for the mean deflection, and the second bracket term is a correction 
to the sum of squares to allow for the combined effects of noise and 
smoothing. 

It is shown by Moyal that neglecting fie departure of the scattering 
law from the Gaussian Law, the relative error on this estimate of §° 


is given by 
4w \2 \4 ¥ 
e(07)/627=(2/n)t (4 ) Bo ete Sart forse 
re} 
where w is the mean value of (X€,£,,,—¢?/n). 

Alternatively, using the arithmetic mean as an estimate for @, the 
systematic effects of noise and smoothing are eliminated by the methods 
described by Goldschmidt-Clermont et al. (1948). 

Smoothing is taken into account by using the angle between cells 
separated by one cell distance, 7. e. (0;,,—0;)=(€;+ §;,1), and then increas- 
ing the measured scattering by 4+2 per cent*. The noise level for the 


_ * Tf this method were ce to the root mean square estimate we would 
have to calculate 42(é,+ €;,1)?=2€2426éé;,1. The second term is just half 
the correction suggested by Moyal. 
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track is not known, but is estimated from measurements on similar tracks 
of such high momentum that the true scattering is negligible, and the 
observed scattering is reduced accordingly. 
The (n—1) measured angles are not statistically independent. This 
effect is allowed for by treating 3 of the observations as independent. 
Corrected for the effect of cutting off large angles, the statistical error 1s 


0-54 [2(n—1)]-#. 


Ss Fig. 6. 
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The experimentally observed spurious scattering per cell as a function of cell 
length for single settings and for multiple settings. 


(5) Choice of Cell Length. 


The choice of cell length results from a compromise between two 
conflicting requirements: a short cell reduces the statistical errors while 
a long cell decreases the noise level. The optimum value is that which 
renders minimum the total error and can be calculated from the empirical 
law of variation of noise level with cell length. Measurements have been 
made on high energy tracks to determine the magnitude of the spurious 
scattering and its approximate variation with cell length. 

Analysis of the experimental results shows that the noise may be 
divided in two parts : 
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(1) One component, which we may call setting error, is introduced by 
the inability of the observer to set the line of the graticule accurately 
along the mean line of the grains. This error can be determined by 
repeated settings on a cell, and may be reduced to as low a level as is 
required by averaging a suificiently large number of settings. 

(2) The second and more fundamental source of spurious scattering is 
the spread of the grains about the true direction of the track. The mean 
angle « between the track and the best mean line which can be drawn 


Fig. 7. 
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The relative error of estimation of energy at-optimum cell length from scattering 
using single and multiple settings as a function of energy. The auxiliary 
lines show the error in the estimate of momentum from magnetic deflection 
for electrons, protons and 7 mesons, given by equation (2). 


through the grains can be calculated by supposing the grains to lie at 
random on either side of the track and at a mean distance r from it. It 
is found that | 
oe Ay 
Ca Jd 


h-3?, 


Where d is. the grain density, and h the cell length; r may be assumed 
to be of the order of the radius of a grain, and A~™1. 

Fig. 6 shows. the observed variation of noise level with cell length. 
The upper curve gives the spurious scattering for single settings, the 
lower one that for averages of several settings. The number of settings 
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to be averaged increases as the cell length increases. The lower curve 
may be assumed to represent the noise level arising from the second 
source, It will be seen that it is less than 1’ for cell length of 200-p. 

The value of the optimum cell length depends on the type of analysis 
used. With Moyal’s method, where the noise level is calculated from the 
set of observations, it can be shown that the minimum error is obtained 
when the total mean square scattering is the sum of two nearly equal 
terms due to noise and true scattering 
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The variation of total error of momentum estimate by scattering with cell 
length. 


where p is a constant of the order of 0-5. The values calculated for the 
total error for a single setting and for multiple settings are presented in 
fig. 7 as a function of momentum. 

Fig. 8 shows the theoretical variation with cell length of total errors. 
Owing to the rapid variation of spurious scattering with cell length the 
total error is seriously increased by using too short a cell length. On the 
contrary, the use of somewhat too large a cell length will increase it only 
slightly. It will be seen from the figure that the use of a cell length equal | 
to twice optimum increases the error by only 20 per cent. 
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For the alternative method of analysis it is necessary to use a somewhat 
greater cell length, since the effect of setting errors is estimated from 
measurements on other tracks, and we require that the uncertainty 
introduced should be small. As shown, however, this does not seriously 
increase the total error. 


SIV. EXPERIMENTAL MeErHops. 
(1) Huposure and Processing of the Plates. 
(A) Sea level plates. 


The photographic plates were exposed in a vertical position in an 
electromagnet being built for the University of Colombo, Ceylon, at the 
firm of Metropolitan Vickers Ltd., Manchester. The electromagnet was 
similar in design to that described by Blackett (1934) for. use in cloud 
chamber studies of cosmic ray events, having poles of 10 in. diameter. 
For the present experiments the poles were coned down at an angle of 
60° with good quality high permeability steel (kindly supplied by 
Metropolitan Vickers Ltd.) to a diameter of Lin. The gap used was 1 cm. 
With this arrangement and a power consumption of 6 kilowatts the 
field was 34,000 gauss. 

The plates were Kodak NT4 plates 200 1 thick emulsions and 2 in. x 2 in. 
dimensions. They were carried in a horizontal position from London as 
soon as they were dry, and two were put in the magnetic field. Two 
other plates were kept in a vertical position in the Physical Laboratories, 
Manchester University, as controls. Both batches of plates were exposed 
for 20 days and then developed together. The method of ‘‘ Temperature 
Development”’ was used, the developer being Azol. Great care was taken 
in treatment of the emulsions, especially in the drying stages, to avoid 
distortion. The plates were processed in a horizontal position. 


(B) Pic du Midi. 

These plates were exposed, also in a vertical position, in the field of the 
Blackett magnet which has recently been transported to the Pic du Midi 
(2800 m.). The pole pieces were 74 cm. diameter, made of soft iron with 
permendur tips and gap about 1 cm. With a power consumption of 
12 kilowatts the field was 34,000 gauss. Ilford G.5 plates, 200, 400 and 600 
microns thick and 5 inches square were used. 

The large area was taken to make the maximum use of the field and also 
to reduce distortion due to edge effects. The plates were dried horizontally 
at Ilford Ltd., to avoid stresses being introduced into the emulsion, 
which can later give rise to distortion. They were brought to Pic du Midi 
in a horizontal position to reduce the number of long no-field tracks, 
exposed for three weeks, and then brought back for development, again 
horizontally, and at low temperature to reduce fading. ° 

Development was by Temperature Development, using amidol at low 
pH (Dulworth et al 1950). Great care was taken to avoid sudden changes 
of temperature, and of pH. Fixing was carried out at low temperature 
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(10 to 14° C.), with as little agitation as possible consistent with efficient 
fixing. The plates were washed in a slow non-turbulent stream of water. 
During the drying the plates were surrounded with a ~ guard ring ” of 
wet gelatine plates; the whole placed on the glass bottom of an enclosed 
box, in which the humidity was regulated by the use of a “chimney”. 
The plates were gently warmed by an infra-red lamp placed at some 
distance below the box. In this way the plates were maintained as far as 
possible with equal conditions of humidity all over the surface. 


(2) Measurements on the Tracks. 
(i) Scanning. 

Sea level plates.—The plates were carefully scanned on a Cooke M4000 
binocular microscope fitted with a Cooke 45. fluorite oil immersion 
objective and a pair of 8x compensating eyepieces. Many long straight- 
tracks, mostly due to cosmic-ray mesons, were found. Some 30 tracks of 
approximately 1 cm. in length were found within the area covered by the 
high magnetic field, and a similar number of long tracks were found in the 
control plates. 

There were fewer low-energy tracks on the no-field than on the field 
plates, probably because the former were kept inside a building with thick 
walls and floors. 

Pic du Midi plates —These plates were scanned on both Cooke M4000 
and Baker 4BQ microscopes, using either Cooke 20x dry objective or 
Cooke 45x fluorite oil objective, and a pair of either 10 or 8x com- 
pensating eyepieces. Among the events noted those were. chosen for 
measurement which possessed tracks of sufficient length. In 30 cm.? of 
one 600 plate 40 tracks were found suitable for measurement. This does 
not include tracks of electron pairs, which are only found consistently 
with more careful scanning. 

(ii) Measurements. 

The method of measurement of scattering and of deflection has been 
mentioned in §III., and is described elsewhere (Y. Goldschmidt- Clermont 
1950). 

To define the direction of the track accurately we require a very fine 
straight line. Following the procedure of Braddick (1950) we have 
produced graticules consisting of one line, or of two parallel lines about 
1 mm. apart, by ruling with a fine point on a photographic plate. Each 
line is of width about 0-01 mm. These have proved very suitable for this 
work. The grains are set symmetrically along the single line or down the 
space between the two lines. Different cell lengths are obtained by using 
eyepiece stops, and changing the magnification. 

On each plate the distortion has been measured by the method of 
Cosyns and Vanderhaeghe, using measurements on steeply dipping 
tracks, and corrections applied to each track according to its dip and its 
orientation with respect to the distortion vector. As an additional check 
in the case of the sea level plates no-field control plates have been 
examined. 


Fast Charged Particles in the Photographic Emulsion 1045 


On these plates 16 tracks are deflected by less than 0-1° over distances 
of the order of 1 em. One track in this group ran for 5 cm., and 
disregarding the 0-5 cm. near each end where it is in the danger zone 
near the edge of the plate, the deflection was 0-05°. Corresponding figures 
for the high energy particles in the field plates are that eleven such 
tracks were deflected by less than 0-1°. 

Direct measurements have been made on the Pic du Midi plates. The 
distortion is, of course, least on long tracks, corresponding to their small 
dip. As examples of the total distortion over a whole track length ~lem., 
we found 2’ in one case and 25’ in another. 

In many cases it has been possible to identify a particle by comparison 
of observations on che grain density and scattering. (Camerini et al. 1949, 
Cosyns 1949, Fowler 1950, Goldschmidt-Clermont 1950, Occhialini 1949). 


§V. EXPERIMENTAL RESULTS. 
(1) Determination of the Curvature. ) 
Eleven tracks on the field plates and eight tracks on the no-field control 
plates have been examined and analysed by Moyal’s method. The results 
for these tracks are presented in Tables I. and II. respectively. In figs. 9 
and 10 are presented detailed plots of the tracks. 
The values of the ratio 


y=$10,v/t 
have been ¢alculated for both types of track. For no-field tracks these 


should be distributed normally about zero mean. The standard deviation 
of the distribution is given by 


2 4 e2(0,))# 
€(y)= fait tee 


the second term in the bracket is frequently small. 

For field tracks it should have mean value y, and the same standard 
deviation. 

The weighted mean value of | y| obtained from the no-field tracks is 


| | =0-49-+0-39. 
For the field tracks the value | y| is found to be 
| y| =1-96-40-44. 


The difference is 1-47-L0-59. 

It seems that the presence of a significant effect from the magnetic 
field is established. To show clearly the effect we have plotted in fig. 11 
the value of g=y,./t which is subject to a standard error approximately 
+,4/(7/2) against the square root of track length \/t, field and no field 
tracks being plotted on the same diagram. 

The mean value of |y|for the field tracks is consistent with the 
calculated theoretical value y)=1-46 within the rather wide errors. 
Many more tracks will have to be examined before it will be possible to 
give an experimental calibration of the scattering by this method, 
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The correct sign probability has-been calculated for each track from 
equation (4), and is also presented in Table I. It will be noticed that owing 
to the generally high values of y observed, leading to the mean somewhat 
above yp—1-46, these probabilities are quite high. 
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Detailed plot of eleven field tracks arranged in order of their momenta, 
and corresponding to the eleven tracks in Table I. Tracks 3 and 4 are 
two halves of the same track. For clarity negative deflections have been 
inverted to prevent crossing of the lines. 


A discussion of the actual significance of the observed deviation is 
a little difficult, for clearly both the deflected and the undeflected tracks 
have mean deflection zero if we disregard the possible positive (or 
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negative) excess. On the other hand, a presentation in terms of the 
modulus of the values of y does not give an average which has any real 
meaning since the distribution about the mean is a function of track 
length. One way of presenting the data which seems reasonable is to 
calculate for each track the quantity {y/e(y)}. For the no-field tracks 
this should be distributed about zero with unit standard deviation. 
» It follows that the sum of squares of terms such as D{y/e(y)}? should be 
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Detailed plot of nine no-field tracks. The first eight are those listed in Table IT., 
the remaining one being a very high energy track, which is included to show 
the low level of distortion. 


distributed according to the so-called xy? distribution, and the pro- 
bability of the observed value being exceeded is easily found from tables 
of this distribution. The same analysis applied to the field tracks will 
then show whether the observed set of values of y are consistent with 
the hypothesis that they belong to an undeflected (no-field) distribution. 
For the eleven field tracks the value was 


Atyle(y)PP=22°5 
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and reference to tables shows that this value is exceeded with a probability 
of only 3 per cent in a random sample from a normal distribution. with 
zero mean. It follows that the observed deviations are not consistent 
with the hypothesis that they represent merely a fluctuation from a set 
of undeflected tracks. 

- Moyal’s statistic can give a negative value for the true scattering 
where the total relative error is large. This happened for some of the 
no-field tracks since most of them were of rather high energy, and these 
were discarded. Tracks of similar energy for which the calculation gave 
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The observed ratio g=4/0,, of the total deviation ¢ to the mean scattering 
deviation 6,, for eleven field tracks € and eight no-field tracks % as a 


function of »/t. The straight line is the line g=y,1/t for yp=1-46. 


a positive true scattering would be accepted. This introduces a bias 
tending to underestimate the average energy of the tracks. When the 
test is applied we are, therefore, not surprised to find that the values 
of y observed are significantly small. The value obtained for the eight 
no-field tracks was y?=2-16. For arandom sample there is a probability 
of 7 per cent that x? should be smaller than this. Operation of a similar 
bias in the case of field tracks could only have reduced the calculated 
significance, so that our evaluation of it is a lower limit. 
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(2) The Determination of the Sign of some Particles in Cosmic Ray Events. 


In the plates exposed at high altitude measurements have been made 
on several tracks of which in some cases the sign and the direction of 
motion can be deduced from the nature of the event. In others, such as 
electron pairs, conservation of charge must be maintained. These two 


Fig. 12 
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Diagramatic representation of the results presented in detail in Table II 


groups can eventually be used as a verification of the efficiency of th 

method when sufficient statistics are available. When either or both the 
sign and the direction of motion of the particle are unknown a pri ‘ 
the measurement -of the sense of the deflection can be used in the a ae 


of the event. nalysis 
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These tracks are listed in Table III. and sketched diagrammatically in 
figs. 12 and 13. ~ 

Track 1, identified as a proton, links two stars, one of which is evidently 
of higher energy and is therefore assumed to be the primary. This gives 
the direction of motion of the particle. Its sign is found to be positive 
with a probability of 95 per cent. This event is shown in the mosaic of 
Pl. XXXIV. 


Fig. 13 
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Diagramatic representation of the results presented in detail in Table II. 


Track 2, also a proton, gives rise to a knock-on electron, from which 
we can deduce the direction of motion. There is a probability of 93 per 
cent that the sign of this proton should be positive. 
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A branch of a star, track 3, is identified as a proton, its orientation and 
energy indicate that it comes from the star. It is found to be positive 
with a probability of 95 per cent. 

Another proton, track 4, is the only high energy branch of a 4-branch 
star and the only one near the vertical. If it is the primary of the star 
it is positive, with a probability of 69 per cent. 

Event 5 is the electron of a 7—--e decay, it is found to be positive with 
a probability of 62 per cent. It was found in a corner of the plate, where 
the field was only 25,000 gauss. ; 

Next there is a group of y-ray pairs, of which the particles have energies 
ranging from 29 to 220 MeV. In each case one track of the pair is found to 
be positive, and the other negative, as should be expected. 

Event 10 is a “ giraffe” (G. Occhialini 1949). Since this is the direct 
production of a positron-electron pair by a charged primary, we expect 
to find the charge of the primary conserved among the three secondary 
particles. In fact, we find the primary to be positive with 83 per cent 
probability ; the secondaries to be two positive, one with 78 per cent 
probability, the other with 70 per cent probability; and the third negative 
with 96 per cent probability. There is also a rough energy balance, within 
the errors of measurement. 

Event 11 seems to be a proton—proton collision at high energy. If 
this is so, the track coming from the vertical, a proton of 150 MeV. energy, 
is positive with a probability of 59 per cent; the continuation and recoil 
tracks are positive, one with 67 per cent and the other with 61 per cent 
probability. The angle of collision is 90°, the energy of the primary is 
divided between the two secondary tracks in a ratio in agreement with the 
angles of deflection. 

Track 12 is of a meson which, by the increase of its grain density and 
scattering, can be seen to be approaching the end of its range. Its average 
energy is 20 MeV., its sign negative with a probability of 88 per cent. 

The orientation of track no. 13, a meson by its grain density and 
scattering, is such as to make it likely that it comes out of the star; if 
this is so it is negative, the probability of the deflection being 81 per cent. 

Track 14, also a meson, is moving nearly horizontally, but its energy 
is so low that it seems unlikely to be the primary of so large a star. Taking 
it to be a secondary, it is positive with 73 per cent probability. 

In event 15 it is difficult to tell whether the track is a primary or 
secondary from either its orientation or energy. Since, however, it can 
be identified as a proton, and the deflection is such as to make it 70 per 
cent positive if it leaves the star, we accept it as a secondary. 

Track 16 seems, by orientation and energy, to be the primary of a star 
of 10 low energy branches; it is a meson and has 62 per cent probability 
of being negative. 

Event 17 is a star of many low energy branches, with one higher energy 
track coming from the vertical. Its energy and grain density indicate that 
it is a proton. It is positive with a probability of 98 per cent. 

The average value of y deduced from these measurements is 1-56-33. 
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§VI. Discusston. 

The results obtained_seem to show that the use of a magnetic field to 
deflect tracks in the photographic plate can already contribute sub- 
stantially to the information available about the cosmic ray events 
recorded. We discuss here the improvements which might be made to 
the technique in order to exploit to the full its possibilities. 

Favourable conditions for the determination of the sign and momentum 
are a large value of the product y8/t and a low level of distortion in the 
emulsion. y) can be increased by reducing the scattering constant of the 
emulsion. Already, as announced by Dr. Berriman of Kodak Ltd., and 
by Mr. Dodd and Mr. Waller of Ilford Ltd., at the Bristol Photographic 
Conference (May 1950), it has been possible to produce diluted emulsions 
containing a smaller proportion of silver bromide. An increase of 16 per 
cent in the theoretical value of y, is obtained without loss of sensitivity. 

While such plates present new problems in processing, the advantage 
of increasing yy in this way is such as to warrant efforts to overcome the 
difficulties and to find emulsions with still smaller proportions of heavy 
elements. 

The constant y, may also be increased by increasing the field H, the 
mean track length by increasing the emulsion thickness, d. It can be 
shown that the number of tracks per unit area of plate for which y9v/t 
has a certain minimum value is proportional to H? and to d?. In this sense, 
therefore, an increase in the thickness is as valuable as an increase in 
field. Moreover, while it is often of interest to examine single measurable 
tracks which emerge from a star, it is generally desirable to measure 
several tracks from the same event. If the directions of the tracks are 
independent, it can be shown that the number of events per unit area 
from which n tracks are all measurable is proportional to H®” and to 
ar, 

For these reasons, throughout the experiments we have advanced 
cautiously towards thicker emulsions, and we are now scanning a batch 
of plates 1200 microns in thickness which show very little distortion. 
The table includes events taken from one of these plates. Further 
advances towards thicker emulsions are still possible. 

The main difficulty in increasing the field is the high cost. Fields of 
100,000 gauss or more are, in principle, quite possible, and the design of 
such a magnet is being investigated. Thick emulsions are more economical 
since they reduce the proportion of the field volume which is occupied 
by glass. 

At the very highest momenta even the little information available 
about momentum from the magnetic deflection may be of value. The 
relative error of the momentum estimate by the magnetic deflection 
remains independent of momentum until the distortion becomes important. 
The distortion over a 1 cm. track in a 600 plate has been shown in 
many cases to be as low as 1’, and a correction may be applied even for 
this. We regard the upper limit of detectable momentum with this length 
of track as fixed by this angle, corresponding, in a field of 34,000 gauss, 
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to amomentum of 3-5 10%eV./c. This maximum is roughly proportional 
to track length and to the field. It is conceivable that measurements 
in the region above 10” eV./c may be possible if fields of the order of 
100,000 gauss or more become available. 

Fields of this magnitude will also make possible a- determination 
of the mass of particles of velocity appreciably less than that of light, by 
comparison between scattering and magnetic deflection, which yields a 
value of 8. Determination of mass by this method has recently been 
described for very low energy light particles in the cloud chamber 
(Groetzinger et al. 1950). 
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Mosaic of event 1 of Table III. The track linking the two stars is shown to be 
positive with a probability 95 per cent, the particle being a proton. The 
direction of motion of the particle is determined from considerations of the 
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XCV. The Momentum and Sign of Fast Cosmic Ray Particles. 


By J. E. Moyat, . 
Department of Mathematics, The University, Manchester ee 


SUMMARY. 


In order to study the statistical estimation of momentum and sign of fast 
cosmic ray particles from their magnetically deflected and multiply scattered 
tracks in the photographic emulsion, the probability distribution of the observed 
angular deviations along the track is derived, taking into account errors of 
observations. With the help of this distribution, approximate expressions are 
obtained for the maximum likelihood estimates of the magnetic deviation and 
the mean-square scattering deviation. The standard errors of these estimates 
are found, and their efficiency is discussed. 

After showing how the observations of magnetic and scattering deviations 
in a large number of tracks can be combined to verify theoretical mean-square 
scattering formule, the estimation of the momentum p of particles from these 
deviations is discussed. An expression is derived for the value of the optimum 
cell-length, at which the error of the scattering determination of p is minimized. 
The relative errors of the magnetic and scattering determinations of p are 
compared at this optimum cell-length : it is found that there is a critical value 
of p, depending on the field, below which the scattering determination is more 
accurate. 

An expression is given for the probability that, given the signed total deviation 
of a track in the magnetic field, the particle should be positively or negatively 
charged. A method is given of estimating the ratio of positively to negatively 
charged particles in terms of the observed signed deviations in a number of 
tracks. 


§1. InTRopUCTION. 


Fast charged particles passing through the emulsion of photographic 
plates suffer multiple scattering due to elastic collisions with the nuclei 
in the emulsion. Ifa uniform magnetic field is applied normal to the plate, 
a constant angular deviation per unit track length is superimposed on 
this random scattering. Estimates of the momentum of a particle may 
be obtained either from the mean-square angle of scattering (cf. 
Goldschmidt-Clermont et al. 1948), or from the mean curvature due to 
magnetic forces ; estimates of its sign, however, are only possible from the 
magnetic deviation, provided, of course, we know the direction of travel 
of the particle when it enters the field. 

As the expected mean values of magnetic deviation due to fields of 
20,000 to 250,000 gauss is of the same order of magnitude as the scattering 
angle and the spurious apparent deviations due to finite grain size in the 
emulsion, errors of observation, etc., the statistical problem of estimation 
from photographic plate data becomes important. The purpose of the 


present paper is to develop the theory of multiple scattering with a 
ee ee 
* Communicated by Professor P. M. S. Blackett, F.R.S. 
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superimposed magnetic deviation in a form suitable for the study of 
estimates of the signs and momenta of particles from data contained in 
their tracks, and to evaluate the relative merits of such estimates and 
the errors affecting them. 


§2. Motion oF Muurrety Scarrerep ParricLes In A UNIFORM 
Maanetic FIEevp. 


In this section the theory of multiple scattering is expressed in a 
form convenient for the solution of statistical estimation problems. 
We shall be concerned with the “ true ” motion only: the effects of errors 
of observation will be considered in the next section. We limit 
ourselves to particles fast enough for the loss of energy to be negligible 
in any given track, so that the velocity B® has very approximately a 
constant magnitude f ; the letter ¢ will therefore be used to denote both 
time divided by 8 and distance along the track. We shall use the units 
common in cosmic ray work : velocity will be measured in multiples of 
the velocity of light c, charge in multiples of the electron charge, energy 
in electron-volts, momentum in eV./c., length in cm.; magnetic fields, 
however, will be in gauss (a unit consistent with the previous ones would 
be V/cm.c.). 

The quantum mechanical treatment of scattering by nuclei of relativ- 
istic charged particles, such that nuclear interactions and the recoil 
of nuclei can be neglected, gives the following results (Williams 1939 ; 
cf. also Moliére 1947) : 

(1) The cross-section for single scattering is very approximately the 
Coulomb cross-section modified to account for screening and finite size 
of the nucleus. Roughly speaking, the effect of screening is to introduce 
a lower limit to the scattering angle, while the finite size of the nucleus 
introduces an upper limit. 

(2) Successive collisions may be treated as independent events; in 
a dense scatterer, the distribution of the total angle through which 
a particle scatters in an interval 5¢ long enough for a large number N of 
collisions to occur is, therefore, the resultant of the composition of N 
independent modified Coulomb scattering distributions. The mean- 
square of the angular deviation 5¢ projected on a plane is then shown to be 
proportional to dt: 


| Spt tS ae Ca ae oe) 
ome 


(the bar over a symbol denotes its expectation value) ; the mean-square 
deviation per unit track s? is shown to be inversely proportional to the 
square of the momentum 9, the coefficient v depending only on the properties 
of the scatterer. For either Ilford G5 or Kodak NT4 electron sensitive 
plates the calculation of v on the basis of Williams’ approximate formule 
gives v=8-48 x 10 (with 5¢ in radians). As 6¢ increases, the distribution 
of 8¢ tends rapidly to the Gaussian in its central part, but maintains a 
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longer “tail”? to begin with, due to an appreciable probability of 
occasional close collisions with large scattering (cf. Williams, loc. cit. ; 
see also Goudsmit and Saunderson (1940), Moliére (1948), Snyder and 
Scott (1949), and Lewis (1950) for a more complete treatment; the 
application to photographic plates is considered by Goldschmidt- 
Clermont (1950). In particular, for the order of intervals and range of 
momenta of interest in this paper, one may assume (as we shall. do 
henceforth) a Gaussian distribution for 54, provided one excludes these 
occasional large-angle single scatterings from the observations, e.g. 
by ‘cutting’ the observed distribution at its mean + 3 times its 
standard deviation™. 

What we shall require for later work is the joint distribution of angular 
deviations and coordinates of particles for tracks selected to lie 
approximately in a plane (the emulsion plane). This may be obtained 
by solving the appropriate “‘ diffusion ’’ equation which this distribution 
function obeys (cf. Fermi, as quoted by Rossi and Greisen (1941)); it 
is convenient, however, when dealing with the estimation problem, to 
treat directly the equations of motion for the angles and coordinates 
themselves +. Let then 0, ¢ be the polar angles defining the direction of the 
particle velocity B, with the emulsion in the equatorial or (x, y) plane. 
The relativistic equations of motion for a multiply scattered particle in 
the presence of a uniform magnetic field H normal to the emulsion plane 
lead to the following expressions for the increments 60, 5¢ of the polar 
angles in a small interval 6¢ : : 


d0=6@; sin 06¢6=w sin 0+89, ee re at es 
where 


300 eH 
oOo 
Pp 


is the magnetic deviation per unit track (e being the charge, p the 
momentum of the particle), and 50, 5® are the projections of the 
scattering angular deviations in d¢ on two orthogonal planes containing 8, 
the first containing the z-axis as well. Owing to the symmetry of the 
scattering distribution about the velocity at the beginning of the interval, 


60 and 8® must be independent with zero means and equal mean squares, 
so that 


(3) 


SO2—8S—= 8... Sk. A) 
The formal integration of (2) from time 0 to t gives 


$O=dtot4 | SA, 


re | 
* See Dilworth et. al. (1950) for further discussion of effect of cut-off. 
} These correspond to the Langevin equations for the Brownian motion, 


while the diffusion equation corresponds to the Fokker-Planck ti 
e.g. Moyal 1949). i set en SAI (eS) 
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where 9 is the azimuth at t=0. Since we are selecting tracks lying 


approximately in the (x, y) plane, @= @ +e) , where «<5° say. Neglecting, 


then, terms of order <2, we have 


t 
HO) =Fo-tat+ | IP) = hott +O). 6 6 6. (5) 
The velocity components along the x- and y-axes respectively are 


B.,(t)/B=cos 6=cos (f9-+at-+) = cos y>—(wt+) sin do, 6) 
B,(t)/B= sin d=sin ($9+at+ ®) = sin $+ (wt+) cos go, 


where the last approximate expressions in the right-hand sides of 
equations (6) are valid as long as both magnetic and scattering deviations 
remain small (both s1/¢ and wt<5° say). Integrating, we obtain for the 
coordinates of the particle 


t 
x(t)=ay-+t cos do—Jut? sin ¢)— sin bo | (t—7)dG(r), 
0 
t 
y(t)=Yo +t sin Jo +4wt? cos dg+ cos oo | (t—r)d®(r), 
0 
where Xp, Yo are the coordinates at t=0. 

Equations (5) and (7) are valid in the small angle approximation, 
whatever the distribution of 66. If the latter is Gaussian, then it can be 
shown (see e.g. Moyal 1949) that the joint distribution of ¢, x and y 
will be of the multivariate Gaussian type, whose general expression for n 
variables z,, . . . , 2, (which we shall require again later) may be written 
succinctly as follows (cf. Kendall 1946 or Cramer 1946): let the vector z 
stand for the n variables, m for their means, A for their covariance matrix, 
whose elements are 


Aig=(2;—) (2;—M)), 
A- for the inverse of A and A, for its determinant ; the probability 
that z should lie in the n-dimensional volume element dz, . . . dz, is then 
P(z)dz, .. . dz,=(27)-#"A,,* exp {—}(z’—m’)A-\(z— m)}dz, ... dz,. (8) 
P(z) is completely determined by the means and covariances_ of the 
variables. In the present case the means are obviously (since 56=0) 


m,—b=ol, 
M,»=L=L)+t cos Py— fut” sin do; oe spec (9) 
My=JY=Yott sin got got? cos dp. 


The calculation of the variances and covariances is simplified by the 
fact that the scattering deviations over non-overlapping intervals are 
independent, so that formally 


d®(r)d®(r')=s8"8(t—7' )drdr’, 
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where 6(r) is Dirac’s delta-function. Hence, e.g. 


[fener eo do) =8 | (C—atde= Bs 
0 0 


/0 
and similarly for other terms involving the expectation values of products 
of such integrals.. Finally, we find 


App =? =8"t, 

Apgar’? = 48248 sin? dp, 

Aywy=Y = 3843 cos? do, (10) 
Apa='u' = het? sin do, 

Aw =e y =e? t? cos do, 

Aw=ty'= 457t3 sin do COS do, | 


where we have written ¢’=¢— $, etc. With the origin of coordinates. 
at ao, Yo, and the x-axis along the initial direction of the particle (¢).=0) 
so that in the small angle approximation i,,,=0 and x=, the distribution 
reduces to that of ¢ and y only : 


Pd, y, #) 
= Ager |- 5 [#2(¢—wt)? —3(d—ot)(y Jol?) +3(y—hoot™ “} 
math 2B 2 y—zot)"] p. 
(11) 
Fermi’s solution of the diffusion equation (Rossi and Greisen, roe cit.) 
is obtained from (11) by making w=0. 


§ 3. THE DISTRIBUTION OF OBSERVED ANGULAR DEVIATIONS. 


The fundamental quantities that we have to estimate from the data 
in the tracks are: (a) the magnetic deflection, and (b) the mean-square 
scattering deviation, i.e. in the notation of §2, the parameters w 
and s*. Two experimental methods have been used for this purpose : 
in the first, which we may call the angular difference method (Goldschmidt- 
Clermont e¢. al. 1948), the eyepiece goniometer of a microscope is rotated 
until a straight line graticule is in line with the track, and its angle with a 
fixed reference line is noted. The operation is repeated along the whole 
length of the track, advancing each time the plate (which is mounted 
on a stage movement with micrometer adjustment) by a fixed distance, 
called the cell-length. The estimations are made from the angular differences 
between successive cells. In the second method, which may be called 
the coordinate method (Fowler 1950 ), the track is aligned approximately 
parallel to one of the directions of motion of the stage, say the x-axis 
and the y- coordinate at which the track intersects a graticule in the 
eyepiece is determined by means of an eyepiece scale. These measure- 
ments are repeated at every cell-length along the whole track, as in the 
first method. Clearly, this method could be extended to measure the 
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coordinates of every blackened grain : this would yield all the information 
contained in the track. Cell-lengths would then cease to be constant, but 
would be equal to the distances between successive grains : apart from 
this last added complication, the estimation problem would remain 
unchanged. Methods based onthe same principles may, of course, be 
used in conjunction with a projection microscope. 

The first step in formulating the problem of statistical estimation is 
to derive the probability distribution of the actual observation, taking into 
account not only scattering, but also errors of various kinds. This requires 
a closer analysis of the actual methods of observation: we consider 
first the angular difference method. When aligning the eyepiece graticule 
along the track, the observer has to use his judgment to determine the 
best straight line through the grains in each cell (for a particle at minimum 
ionization, the grain density is of the order of 33 in a 100-micron cell). The 
line taken may be expected to lie somewhere between the tangent at 
midpoint and the chord in each cell (smoothing effect). There will, of 
course, be a random error due (a) to observer and instrumental error, 
and (b) to finite grain size and to the asymmetry of grain distribution 
along the trajectory : the centres of developed grains may be slightly 
displaced from the actual trajectory. In addition, there is the possibility 
of systematic errors due to distortion of the emulsion. Special 
precautions are taken to eliminate this last effect: we shall therefore 
ignore it in our analysis. Here we shall assume that at each measurement 
the observer aligns the graticule along a line which divides the angle 
between chord and tangent in a fixed ratio q, with a total random error e, 
due to various causes enumerated above. (We shall call « the spurious 
deviation and q the smoothing factor.) We also assume that « has a 
Gaussian distribution, with zero mean and the same standard deviation 
S) in every cell. These assumptions seem to be well-founded on the basis 
of the careful study of sources of error made by Goldschmidt-Clermont 
et. al (1948 and 1950). 

We proceed now to derive the required probability distribution of the 
observed angular deviations. In a track of total length ¢, let h be the 
length of each cell, n+1 the number of cells, so that (n+-1)h=t : we shall 
assume throughout in what follows that m is large, e.g. >25. We take 
the track to be initially aligned along the x-axis, and define all angles. 
relative to this axis. Let then 7%, be the observed angle so defined of the 
rth cell, d, the angle of the tangent to the true trajectory at the midpoint 
of this cell, «, the angle of the corresponding chord : 


= (Yrai—Yr)lh, 

where y,, Y;41, are the end coordinates of the cell; «, is the error of 
observation in the setting of ,. According to the foregoing assumptions 
P=, +9(b-—Hp) Hep 


a ee a (1) 
8b, =P 41 P=, + (L—G) 8a, + enya — €r- | 
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Each angular deviation 3), is thus a sum of Gaussian variables, and is 
therefore itself Gaussian. Hence the joint distribution of the 
observations (known as the likelihood function) will be the Gaussian 
multivariate distribution, as defined by equation (8) of § 2, the vector z 
standing now for the n deviations Sy, . . . , dy, ; all we have to do in order 
to find the explicit expression of this distribution in terms of the 
parameters we wish to estimate, 7. e. of w, s and 89, is to express the means, 
variances, and covariances of the deviations in these terms. This is 
easily achieved by the methods of §2, with the following results : the 
means are 


im ob == wl, «(= 152, oa a 5.1) S kes ee 


the covariances are (writing Sep’ =Sy,,—dip,) 


dy r=OoP = = 314g +a? )s*h+ 285, 
r,, 1 Ooh 41 =}(1+q—2q")s*h—s§, a3) 
Ny rh OP Op + k =0, (k>1 3 r=l, 2, ae) n). 


Note that the sum 
Alon, 84,89 de eaeae eee 


is independent of the “ smoothing factor’ g. If q=1, then 
Ap = Sha 289° Padid “Api, (= 65, «| (r= 1) 2, pat) 


In the coordinate method we may expect the random error affecting 
each coordinate measurement to arise, as in the angular difference method, 
(a) from observer and instrumental errors, and (b) from finite grain size 
and asymmetry of grain distribution about the track ; there will, however, 
be no equivalent to the “ smoothing effect’. It is reasonable, therefore, 
to regard the rth measured coordinate as the sum of the “ true” 
coordinate y, and of a total random error e«,, and to assume that the e 
all have the same Gaussian distribution with zero mean and standard 
deviation s). Since this method can be applied to the measurement of 
coordinates of every developed grain, we shall develop the theory ab 
initio for unequal cell-lengths. 

Let h, be the rth cell-length and n+1 the number of cells in a track 


n+1 
of total length t, so that 2% h,=t. It is convenient to form the estimates 


1 
in terms of second -differences of the observed coordinates, corresponding 
to the angular differences of the previous method : we write, therefore 


Bip, — Sree tras _ rete | Sea fret Set gy 


r+1 h, hyny r 
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Once again these deviations are sums of Gaussian variables, and, 
therefore, Gaussian themselves : their joint distribution is hence again of 


the form (8),cf. §2. Calculation of the means, variances and covariances 
gives the following results : 


m,=dy,= 3( (Ress +h,) , 


1 
dppb? 2(h 
rr b? = 38"( heii th, )+28% (a Zi eae ol = a) 
Nes (itd = SiBi aie bra (Fa : ae ide 3" x) ? c (7) 
his Mrst had hyyih,. : 

<a 1 

Ay neg = Op SWi, -#(;-). 

7 f-+2 if if +2 0 bee 
Ne pO Ov,, ,—=9, (k>2 6 — if 2, ar ea) 5 n). 
Note that in consequence | 

ena a Sie ra kien at Rothe r42=95"(h, +h, 41). is hi (8) 


§ 4. Maximum LIKELIHOOD ESTIMATES OF THE PARAMETERS. 


What we must do now is to establish suitable statistics giving s and w 
in terms of the observed deviations ; it will be necessary, incidentally, 
to estimate s) as well. We use for this purpose the maximum likelihood 
method (cf. Kendall 1946, Cramer 1946), i.e. we maximize the pro-’ 
abbility of the whole set of observations, obtained in the last section, 
with respect to the parameters, w, s and s) considered as unknowns. In 
this section we shall give a formal solution of the problem, which 
covers the angular difference and the coordinate methods. 

Let h stand for the cell lengths h,,..., A, ; the means vector is then 


m=wh. 
The logarithm of the likelihood function is (cf. equation (8) of § 2) 


los P— — Slog 2n—t log A,—3(z'—wh’) A“"(z—wh). . . (1) 


The covariance matrix A is a function of s and sy only (cf. equations (3) 
and (7) of § 3). It is convenient to express it in the form 


es Gy) NAN wee me fc cof og ce ae eee) 


where 7 is a linear combination of s? and s?, y some function of s? and s?, 
the elements of the matrix A are functions of y only, and to maximize 
the likelihood function 


; 1 
_L=— Flog (2a)— blog Ay— 5-(2'—ohA(Z—oh).« (3) 
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(A,, being the determinant of A) in terms of these new variables. Solving 
@L/éw=0, one obtains for the estimate & of the magnetic deviations per 
unit track 
h’A1z 
=m 4 

O= oh’ @) 
Similarly, the estimate 7 is obtained by solving oL/dn=0 : 

LE ate aes PP ce 


where €=z—wh. In order to estimate x we write 


oL | oye - 


or, substituting for 7 from (5), 


5 (AMMBAME}=0. ie ec 


Substituting for w from (4), (6) becomes a transcendental equation 
in x only. The substitution of its solution x in (4) and (5) gives the 
estimates & and 4 ; finally s? and sj are obtained from 7 and x. Formally, 
this completes the solution of the problem. In practice (6) has to be 
solved by some approximation method: e.g. by choosing xy such that 
|x|<1, one can expand A, and A™ in powers of x, and approximate 
to the left-hand side of (6) by a polynomial in x (cf. the calculations in the 
next paragraph). 

It has been shown in the case of independent observations (cf. Kendall, 
Crameér, loc. cit.) that the variances of the maximum likelihood estimates 
are asymptotically the smallest, in the sense that no other estimate can 
have a smaller variance in the limit when the number of observations 
n—>oo. Though this proposition is not proved in general for correlated 
observations, we may yet expect it to hold in the present case, because 
the correlations extend only to first, or first and second, nearest 
neighbour. It is often found that these variances have comparatively 
simple expressions, even when the maximum likelihood estimates 
themselves are complicated. In such cases, their knowledge allows one 
to assess the efficiency of any simple (or approximate) estimate, defined 
formally as the limit when »-—>co of the inverse ratio of the variance of 
the latter to that of the corresponding maximum likelihood estimate. 

The expression for the variances of the maximum likelihood estimates 
may be obtained as follows (cf. Kendall, loc. cit.): let 6,, @5,... be the 
parameters we wish to estimate. The covariance matrix V of the estimates 
0,, 65, ...1is then the inverse of a matrix V~-1, whose elements are 


ey etl! aL 
Vis=—2 sean, t=— | apap, Beez ee Po ReS () 
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(# denoting expectation value). We are interested only in the variances 
of 6 and 8*: we may start again from expression (3) for the likelihood 
function but with different parameters, substituting B for A, sh for, 
€ for x, and taking & such that the elements of B are linear in ¢. Write 


$,=w ,0,=s*h and ese The following expressions are found for the 
elements of V1: 


h’Bth n d? log B 
Wits athe | ti gapet Ve — $s 
1 dlogB, 
Ves= 22h, =a 2 Vis V0. 
We (8) 
Finally, writing log B,, =n log b, we obtain 
A ] sth 
var. (w)= vie = A Boh’ 
messi ( 9) 
; V Zsth® | (dbjdé)? 
2 —— pe a SL a = a ee 
var. (s h)= nV Vas n { naez | 


§5. SratisticaL EsTiMaTEs IN THE ANGULAR DIFFERENCE METHOD. 
We proceed now to apply the results of the last paragraph to the 
estimation problem in the angular difference method ; it is hoped to 
consider the coordinate method in a later paper. Write 
ry p= 2, A, p= —, G=1):2,2.450) 


for the elements of the covariance matrix (equations (2) and (3) of § 3), 
and A=vA (i. e., we take y=v in expression (2) of § 4), so that 


1 1) 0 OR ee 0 
—o il ==, WW oa 5 | 0 

A= (1) 
0 0 0 0 —o ih 


where «w/v. It is easily seen that the determinant A, of A satisfies 
the following difference equation (cf. Koopmans 1942) : 


ee gee as we ey ee ten oust) 
which is satisfied by a solution proportional to 6”, provided that 
emapeeaten(h Oe w \ el wn ue we oe (8) 


Finally, introducing as “initial”? conditions the values of A,;=1 and 
A,=1—2?, we find 
m+1__ Qn+1 
je St DP i he Bria iae 5 
Bi—B2 


AD 2 
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where f,, 8, are the two solutions of (3) : 
Bi=R(L+ V/(1—402)) 5; Ba= (1 —V/(1— 4”). » - + 5) 


It is easily shown that the elements A,, of A~t can be written 


A,.=08-" (A=) could eA SET ae ee on 
nm 
Write now A2=8,/, ; after a few simple transformations one finds 
ee 
= T+a : 
A, = (L—A2)-H(L-FA2) (1 A2* D), steel 9A 
eee 


A AE(1—A2")(1—A2@-*7- FED) (TBM ED) 1 2 


1, rie pes? 
A,, and A~! have thus comparatively simple expressions as functions of A 
only, and it will be convenient to maximize the likelihood function with 
respect to v and X; 72. e€. to take y=v and x=A in (2) of § 4. 

From their definition, one. can see that —}<w/v<i, and hence 
a2<i, \2<1. It is shown in the next section that optimum conditions 
are realized when s,)/s~4, and hence «~ and A?~0-03, so that near 
optimum A*<1 for k>2: in what follows we shall therefore neglect 
1/n and A¥ as compared to unity for k>2p, where p <n/2. 

With these approximations, a straightforward calculation, using (7), 
gives 


1+? 
feiss PSV JT (oe er en 
so that the estimate (4) of § 4 for the magnetic deviation becomes 
23; A,o, 1 2p 
A OSLO. rete 
oF Ly Awe =e TN oe Oba t Ona) ste.* Daag OD 


n 
where =2'dp, is simply the total observed deviation of the track ; 
ue 


the last term in the right-hand side of (9), representing what we may term 
an “‘end-effect ” of the extreme observations, will generally be <y. 
Hence the first approximation to & will be just 


Stab. cond uid ae eee TO 
Similarly, (6) of § 4 gives for (writing £,=8y,—wh) 


; 1+)2 (% 2p n—k 
ie as ae ete BME Eby ( 2 me.) 
1 k=1  r=1 ken 


Pp 2 
==( Bede a) ne Seely 
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‘ 


where “end. effects’ are again apparent in the last two terms in the 
right-hand side of (10). Let us now write 

TAA 
1? 


oe ; we n—k 
E/A-1E— [S+U(A)], where Seed ; and also S,= zi aise 


‘Equation (6) of § 4, which determines 4, may then be written 


d (S+2U(A) S, 
tl 1—)? = 
whence 
A=— = {(1—-A2)0'(A) +2418). We eee) 


where U’=dU/dd. Note the expectation value of S; is zero for k>1; 
the first approximation to the solution of (11) is obtained by neglecting 
such terms, as well as those having A* as factor, with k>2: 


Ree TS 
Substitution in (10) gives the first approximation to b: 
os (LEN Sl ae eee 
nv,=S ey (1424, 3) == Pas Oue—Wln. 2. (12) 


Using the relation «=w/v, we obtain 
n—1 
NO 2 OW, Ob a wen. ss pes NS) 
r=1 


Finally, since s%t=n(v—2w), the first approximation to the maximum 
likelihood estimate of the mean-square scattering deviation is 


n n—1 
t= F BYR+2 EF dp, — 3pm... (1A) 


(we shall use 0, to denote the observed r.m.s. scattering deviation in the 
whole track). Note that this estimate of s? is independent of the smoothing 
factor q: this is a desirable feature in practice, since the value of q will 
depend to a certain extent on the idiosyncrasies of individual observers. 
Equation (11) is in the right form for obtaining higher order 
approximations to 4 by iteration : the kth iterate would be 


1 « ‘ A 
Ay=— B{A AU Aga)+2Ap aT Ana)} = + + (15) 
We can find out whether higher order approximations are worth 


obtaining by calculating the efficiencies of the first order approximations 
above, as defined at the end of the last paragraph. Their variances 
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(standard errors squared) may be calculated by the methods of § 2 a 
§3: the algebra is somewhat tedious and need. not be reproduced ; 
neglecting some terms whose ratios to the main terms 1s of order 1/n, 


SO BO stars, a 


<(824) = (6202)? = : {(s?¢+-2nw)?-+2(s*t)?}. 


The determinants B,, in (8), (9) of § 4 and A, in (1) of this paragraph are 
related by 


log B, =n log (v/s?h)+ log A,=nlogb, . . . ~ (17) 

so that, neglecting terms of order 1/n and A” compared to unity, 
b=Ha+2s2/s*h+ (2a—1+4s5/s*h)}, . . . . (17a) 
where a=2(1—1q-+q*). We therefore obtain the variances of the 


(exact) maximum likelihood estimates by substituting from (17) in their 
expression (9) of § 4, taking €=s2/s?h : thus 


; sth nv 2Qw 
Py pes ls 2 a = 2 
€?(@t) = nBoh! = 4 =n (1 FE ) so 


(18) 
i 2(87t)? (db/dé)?) —_ 2(s?t)? ee 
Py A aati es cael aD peer oN) 14 h)*%. 
<2(824) {1 sata {1 +.2(2a—1 + 489/5%)¥} 
The required efficiencies of the approximate estimates are therefore 
Bm, (,)= im by 
: 1+2(2a—1--48?/s*h)t 
ZA Ve = 
AR hag 2+ (a-+2s?/s?h)? 


The efficiency of s?t, taking a=1, has been plotted in fig. 1 against 
the ratio p=s)/s\/h: it is ~90 per cent near optimum conditions, where 
p~3z*. We conclude therefore that one can be content as a rule with the 
first approximation to the maximum likelihood estimates of w and s?, as 
given in equations (9a) and (14) of this paragraph ; in what follows we 
shall no longer distinguish between the approximate and the exact 
estimates. . 

The fact that the estimate of the magnetic deviation is simply the 
total deviation in the track may seem surprising at first sight, but a 
moment's reflection will show that it must be so. It is clearly the best 
estimate when s)=0 and q=1, since the observed deviations are then all 
independent, and its variance is then s*t. It follows that no improved 


* Fig. 1 can still be used for aAl if we take [p?—4(1—a)]# as abscissa ; 


e.g. with a=5/8 (which is its minimum value) and p=} the efficiency is >99 per 
cent. 
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estimate could possibly reduce the error, as given in equation (16), by 
a factor greater than 1-+-2w/s*t, which is of order 1+1/2n at optimum 
cell-length. 


§6. Estimates or Momentum. Optimum CELL-LENGTH. 


It is proposed in this section to compare estimates of momentum 
obtained from magnetic and scattering deviations under optimum 
conditions, using the angular difference method. Before doing so, 
however, we may note the following application for joint measurements 
of the two deviations : the ratio 


300eH 


Oma 


Big. 1. 


r=) 
S 


a 


Percentage efficiency 


25 


The ordinates represent the percentage efficiency of the GS shai estimates 
of mean-square scattering deviations, plotted against the ratio p=s9/s./h 
of the r.m.s. spurious to scattering deviations. 


of the magnitudes of the magnetic to the r.m.s. scattering deviations per 
unit track, w and s, is independent of momentum and length of track 
(equations (1) and (3) of § 2). Since e and H are known, it will therefore 
be possible to combine measurements from a large number of tracks im 
estimating y», and hence verify the theoretical value for the scattering 
constant v, or alternatively determine it experimentally. For e=charge of 
the electron, v=8-48x108 and H=34,000 gauss, yyp=1:20 cm.?. An 
estimate y of y) from the data in a single track is clearly, from the results 
of § 5, | 

ed mE, Se” 2 (1a) 
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An approximate expression for the standard error of this estimate in 
terms of the standard errors of & and § (known from equations (16) of 
§ 5) is easily obtained by well-known methods : 


2 Bett) eet) oe La yale Qnw\?  4nw \ 
ey + Fant nG | pele. (1 a) + Tope baal) 


When n is large and the spurious scattering small, e(y)=1/./(t). An — 
improved estimate yy of yy may be obtained in the usual manner from 
the data in N tracks (with same e, H and v) by taking a weighted average 
of the estimates y;, from each track : 


Ti eve / 2 : (3) 
TN Sonar (ye) ee) 
The standard error of yy is approximately 
= 1 
Bye )asN | 2) = — 0). ee 
dys)=N | 2 a 


It is equally possible, of course, to combine the data from tracks with 
different ¢, and H, by weighting each y,, inversely as e,H,. 

We now come to the estimation of momentum. An estimate from the 
magnetic deviation, denoted by p,,, is clearly 


ot 300eH nt 300eHt 


ee a i 3 (5) 
Its relative error is approximately - 
029 ee CL 2w\t , 
hme Cate IE I+ shoe see 


Provided that ¢ is large enough and the spurious scattering small enough 
for 2w/s*t to be <1, the product of this relative error by 4/(t), 
UPm=V(te(Pm\P=Uyo. . . - . . ~ (7) 


is approximately equal to 1/79, and hence independent of momentum 
and length of track and inversely proportional to H. 
The scattering estimate of momentum may be written 


vy (t) | 
fa b) 


s 


Vv 
Li a 


(8) 
its relative error is approximately 
€(p,) els t) Mec! 2nw\? $ 
p =a Tamla) +2]. 
In order to compare these two estimates, we shall have to find out how 


changes in cell-length affect the error on p,. The standard error of $2t 
(equation (16) of § 5) may be written as follows : 


Be 282\2 
€*(87t)=2ht (ase+ 5") 420th, 0 Tt ase eee LO) 
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where a=%(1—4q-+q"). A decrease in the magnitude of h must, on the 
one hand, tend to decrease this error, because it increases the number 
n=t/h of observations ; on the other hand, we may expect it to increase 
the spurious deviation s), and hence increase the error. There must, 
therefore, be an optimum value, say ho, of h, dependent on the way s, 
varies with h, which will minimize «(§2t). A stationary value is obtained 
by solving de?/dh=0 ; this leads to 


si (sz) (d(s3)\? d(s?) .2+a 
= 7) +as? Nene r ne oe per iace Key 


hese dig te 


It is easily shown that this gives a minimum, provided that 


d?(s2) —— 2 (U9) __ 8 poms ; 
dh? ho\ dhg hy ho 
There is experimental evidence that the dependence of sy on h is in the 
form of a power law (cf. Dilworth, et. al. 1950), 


Sieaehiiice Gad Corer Gas Ie see) 


with « roughly independent of momentum, and K=~3/2. Substituting 
in (11), we find that the ratio p of spurious to scattering deviations at 
optimum cell-length, 


s2 ak (2+a)(1+2K)] 
on aaR) | tame | 3p 8 


is determined by the values of K and a (a can range from 3 to 1 ; cf. § 3). 
Table 1 shows that p is not very sensitive to variations of either a or K. 
The explicit expression for the optimum cell-length in terms of momemtum 
is 


cp, ee 
ho= (<n)rR. eb ee ce AL) 
p 


0-53 


Substituting this in (10) and (9), one finds for «/(¢) times the relative error 
on p, at optimum cell-length 


n(p,)= VOD — 1+ 4(a+2p%) (=p) FR. ere es >) 
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Both hy and 7(p,) increase with p. Since there are practical limits to the 
cell-length when one uses a microscope, there will be a range of values 
of p outside which one will have to use respectively the maximum and 
minimum practicable values of h. For such values of p, h is fixed and 
n(p,) becomes 


2\ 2714 
“(K+1)/% Peo BAe 
ntpa)=W[ 144 (L-pan-e (Z2)) |-- (16) 
With «=4-25 x 108, K=3/2*, v= 8-48 x 108 and a=1 


hy (microns)=(p MeV/c.)*/, 
n(p,) em.t=1-5 x 10-2 (p MeV/c.)?/5, (h=hpo), 


and the optimum cell-length range with 25 microns <h<400 microns, 
is 60 MeV/c.<p<2000 MeV/c. ; 

We saw before that 7(p,,) for the magnetic estimate of p is very 
approximately independent of p : hence for every value of H there will 
be a critical value of the momentum below which the scattering 
determination of p is better than the magnetic one. In the optimum 
cell-length range, and with the values of «, v, K and a above, this critical 
value is 

p MeV/c.=(1-76 x 108/H gauss)5/?. 


§ 7. DETERMINATION OF THE SIGNS OF PaRTICLES. ESTIMATION OF THE 
Ratio oF Positive to NEGATIVE PARTICLES. 


It was seen in the last section that scattering determinations of 
momentum are more accurate than magnetic ones for all momenta 
smaller than a critical value. There is, however, one important item of 
information which can be inferred only from the magnetic deviation, 
and that is information regarding the sign of the particle; such 
information is available provided we know the direction of travel of the 
particle. 

The magnetic deflection is estimated, as seen in §5, by the total deviation 
~ of the track. What we require, therefore, is the probability, given this 
deviation, and assuming the direction of travel is known, that the particle 
should be positive or negative. Let us introduce a charge variable 7, 
with +=1 for positive, = —1 for negative, particles. Let p(t) be the 
prior sign probability of the particle ; we shall also write q=p(1)/p(—1) 
for the ratio of positive to negative particles (not to be confused with the 
“smoothing ” factor of §3). We shall assume at first that q is known, 
either for primaries or for star secondaries ; we shall see later how q can 
be estimated from the statistics of particles. The probability of a 
deviation given the sign of the particle may be written, if we take w 
to mean the magnitude of the magnetic deviation per unit track, 


PW) = 7 : 


(Gast CMe Marre 2S ce lH) 


* Based on preliminary results; see Dilworth et. al. 1950, for improved. 
results and discussion. 
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Hence the joint probability of ie and 7 is 


Beg) ur ptr)s Rosle se ees. (2) 
the probability of pee of the sign of the particle is 
Ee) Pb | Lye pe Leth [1.1 (3) 
Finally, the Bpebeniiey of r oe ys is 
T) —2rey/ I: : 
ha? WY ee betel C3 
l= pay = [1+ ears i 


(We may note at this point that (4) is essentially an application of 
Bayes’ theorem, which is indubitably valid here since the prior 
probabilities p(7) can be observed.) Equation (4) expresses the maximum 
information a track can yield regarding sign: this can be shown in the 
following way. All the information obtained from the track is contained 
in the set of observed deviations 34, (r=1,2,...,). Let z stand as 
before (§ 3) for this set of observations. The considerations above may be 
repeated word for word in terms of the probabilities of z and 7 instead 
of % and 7, yielding 


P(z, 7 ene. —1)]-1 . 
ape aac Plz he Sect Sar 
Substituting expression (8) of § 2 


(7|z)= E nn, exp. (— —2roh'A“2) |", 


or, in terms of the maximum likelihood estimates of w and its variance 
(expressions (4) and (5) of § 4) 


(|z)= pee 


which reduces to (4) if one ashe expressions (9 a) and (18) of §5 
for and ¢?(0). 

Expression (4) is exact if we know s and w (i.e. if we know the 
momentum of the particle). If not, then w/s=yo, which is a constant 
depending only on the field and the composition of the emulsion (cf. § 6), 


and 


can be estimated from the track, where y is now taken as signed. Hence, 


an estimate of (4) is 
7 |b)= IAAT) tet Sts ks (8) 


This estimate becomes dubious for high energy particles, such that the 
error on the estimate $ is large. In this case we may consider the problem 
from a different angle. If the number of cells n is large, the distribution 
of the observed y given 7 is approximately Gaussian, with mean Typ. 


exp (— —2rende0)) | rere 2 5) 
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The probability P(7|y) of sign given y may then be obtained by sub- 
stituting 1/e2(y) (cf. equation (2) of § 6) for ¢ in (5). In fig. 2, P(1|y) is 
plotted against yt for yp=1-20 and q=1. 

Another problem arising in connection with sign determination is the 
estimation of the ratio q of positive to negative particles from a set 
of n tracks of a certain type : e. g. primaries in a certain range of momenta, 
or secondaries from certain types of stars. The probability of a total 
deviation %, in the 7th track, irrespective of sign, may be written 


Pb) = (27r82t,,) ter + opt N28 pel I Corrie’ + (1 —p, eer rity!) ; 


(7=_1, 2 ees) # us. Phe Ss (6) 


A plot of the probability P(1|y) of the particle being positive asa function of 
yt=yv/(t)/8s for yy=1-20. 


where p,—p(+1), #,, 8, are respectively the length and r.m.s. scattering 
deviation per unit track for the rth track. Since the deviations in the 
m tracks are independent, the maximum likelihood estimate of Dis 
obtained by maximizing 


n 
. L= & log P,(,), 
r=1 
7. €. by solving 


oL Le C2 p Ppl8y2? — Q—Op Ppl 8p? 


Op, ‘al f= perrrrit tt (1 — peer Prlsy* male 
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Hence the maximum likelihood estimate of q=p /(1—p,) in terms of 
the n observed deviations is 


n 1 al n 
a E pe | 2P(L|y,3 9) 
= St, free te (1) 
BlLpgerorvrler}a P11 y,5 4) 
1 1 
where y, is the value of y observed for the rth track. Equation (7) 
involves ¢ on both sides and must be solved by a method of successive 
approximation. The following iteration method may be used: take 
Q=1 as the zero order approximation; then the (kth order 


approximation q;, is obtained from the (k—1)th by 


23 PCL | y-3 Gea) 
i ee ee 
ig bye Tn-1) 


If the exact solution ¢>1 then it is easily seen that (a) the sequence q, 
is an increasing one, and (b) that, for every k, q¢;,<4; conversely, if ¢<1, 
then the sequence q;, is a decreasing one, and q,>4 for every k. Hence, 
in both cases q; converges to ¥ as k increases. 
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XCVI. CORRESPONDENCE 


The Velocity of Second Sound Below 1° K. 


By K. R. Arxins and D. V. OSBORNE, 
Royal Society Mond Lab., Cambridge *. 


[Received August 15, 1950.] 


PrsHKOV (1948), Maurer and Herlin (1949) and Pellam and Scott (1949) 
have shown that, as predicted by Landau (1941), the velocity of second 
sound in liquid helium II increases as the temperature is decreased below 
1°K. We have extended these measurements to lower temperatures and 
have found that the velocity rises to 152+5 metres/sec. at 0-1°K. 
Subject to the uncertainties which will be discussed below, this agrees with 
Landau’s further prediction that, at 0° K., the velocity of second sound 
should be equal to the velocity of first sound divided by 1/3. 

The low temperatures were produced by demagnetizing a mixture of 
2g. of liquid helium and 30 g. of ferric alum from a field of 10,000 gauss. 
The mixture was contained in a small metal Dewar vessel and was pumped 
by a diffusion pump of speed 7 litres/sec. through a constriction of diameter 
0-1mm. Owing to certain technical difficulties, this cryostat has not yet 
behaved ideally, and the warming times were only of the order of 60 sec. 
However, this was sufficient to enable the velocity measurements to be 
made and the preliminary results described below establish the main 
characteristics of the variation of velocity with temperature. Temperatures 
after demagnetization were deduced from the magnetic susceptibility of 
the ferric alum, measured by means of a mutual inductance and a ballistic 
galvanometer. The primary current of the mutual inductance was 
automatically reversed every three sec. and the throws of the ballistic 
galvanometer were photographically recorded. 

The second sound measurements used a pulse technique, in which a 
D.C. pulse of duration 100 w sec. was fed into a constantan wire heater. 
The resulting second sound pulse was propagated along a glass tube of 
length 3-40 cm. and internal diameter 0-5 cm. embedded in the ferric 
_ alum, and was received bya phosphor-bronze resistance thermometer. The 
time of flight was measured by displaying the received signals on a cathode- 
ray tube, which was photographed with a cine camera during the 60 sec. 
immediately following the demagnetization. Fig. 1 has been built up 
from tracings of actual photographs. In agreement with the observations 
of Pellam and Scott (1949), the received pulse is seen to be considerably 
distorted and lengthened at these low temperatures. This distortion can 
be explained in terms of dissipative losses during propagation, and a full 


* Communicated by the Authors. 
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analysis will be published shortly. No detectable change in velocity 
was caused by altering the pulse amplitude, so that the shock-wave effects 
discovered by one of us (Osborne, to be published shortly) do not seem to be 
relevant to the present results. 

Fig. 2 presents our results for the variation of velocity with tempera- 
ture, together with previous results at higher temperatures. Owing to the 
large heat influx, it is uncertain how closely the temperature of the liquid 
followed that of the ferric alum, and so the temperatures must be 
considered as only approximate, especially above 0-3° K. when the warming 
up was particularly rapid. Landau (1941) predicts that the velocity of 
second sound (w3) at 0° K. should be 1/4/3 times the velocity of ordinary 
sound (u,). The existing measurements of w, (Findlay et al., 1938, Pellam 
and Squire, 1947) do not extend below 1-5° K. and the extrapolation to 0° K. 


Fig. 1. 


INPUT PULSE AND 
0:5 msec. CALIBRATOR PIPS 


m,.sec, 


0! | 2 S 


RECEIVED SIGNAL 
AT 0:2°K. 


1 RECEIVED SIGNAL 
: AT 0:9°K. 


Tracings taken from photographs of the oscilloscope screen. 


is rather uncertain. A linear extrapolation gives, at 0° K, u,=272 
metres/sec. and w,/.\/3=157 metres/sec. Therefore, apart from the 
uncertainty of this extrapolation and the possibility that w, changes 
appreciably between 0-1° K and 0°K., the relation u,=w,/\/3 is well 
borne out by fig.2. In Landau’s theory the falling away of w, from this 
initial value is due to rotons, and fig. 2 therefore suggests that rotons are 
becoming important at about 0-1° K. and are predominant above 0-5° K. 
For this reason it seems unlikely that the specific heat of liquid helium will 
obey a T? law above 0:1° K. 

The two aspects of Landau’s theory which are mainly responsible for the 
particular result w.=w,/ 1/3 are (1) his method of calculating the density of 
the normal component, p,, and (2) his fundamental assumption that the 
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liquid can be treated as a continuum, in much the same way as a solid is 
treated in the Debye theory of specific heats, and that the thermal motion 
can therefore be built up from phonons and rotons. Near 0° K. only 
phonons are important and, since these are just ordinary sound waves, it is 
easy to see how w, becomes involved in the formula for w,. The alterna- 
tive theory developed by London (1938) and Tisza (1947), attaching less 
importance to the interatomic forces, builds up the wave function of the 


Fig. 2. 
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O + The present results. Two separate demagnetisations. 

—— Results of Peshkov and of Maurer and Herlin at higher temperatures. 

x Highest velocity reached by Pellam and Scott. Temperature 
unspecified. 


liquid as a whole from the wave functions of individual atoms, and then 
makes use of those special properties of Bose-Einstein statistics which 
lead to a condensation of atoms into the lowest state. Temperley (to be 
published shortly) has shown that if p,/p is equated to the proportion of 
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atoms not in the lowest state, then this type of theory always leads to u,=0 
at 0" K. If, however, p;,/p is evaluated by a method similar to Landau’s, 
this conclusion remains true for the Bose-Einstein gas, but probably does 
not hold when interactions between atoms are taken into account. It is 
conceivable, therefore, that a Bose—Kinstein type of theory making due 
allowance for interatomic forces would give a value of w, tending at 0° K. to 
a finite value related, presumably, to the average velocity of zero-point 
motion, which has the same order of magnitude as u,._ Nevertheless, until 
such a theory has been developed and has satisfied the exacting require: 
ment that w,=wu,/1/3 at 0° K., the present experimental results must be 
considered strongly in favour of Landau’s method of approach. 


We should like to express our indebtedness to Dr. D. Shoenberg for 
encouraging this research, to Dr. J. Ashmead for technical advice, and to 
Mr. H. N. V. Temperley for showing us his results before publication. 
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The Velocity of Flow of Helium II in the Creeping Film. 


By L. C: Jackson and D. G. HENsHAw *, 
H. H. Wills Physical Laboratory, University of Bristol. 


[Received August 30, 1950.) 


In their paper on the measurement of creep rates and the thickness of 
the helium II film Daunt and Mendelssohn (1939) combined the two 
results to calculate the average velocity of flow of the helium in the film. 
Their value (about 20 cm./sec. at 1-5° K.) can, however, only have an 
order of magnitude significance as the thickness determination was made 
on a stationary film and was an average value over a considerable range 
of heights above the liquid level and the determinations of the volume of 
helium transported per sec. were of necessity made on the moving film. 
We have, therefore, undertaken the simultaneous measurement with 
the same apparatus of the rate of creep and the thickness of the creeping 
film, using for the latter the optical method of Jackson and Burge (1949). 
a ee ad ie SC a ee 


* On leave of absence from the National Research Council of Canada. 
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In addition to providing the mean velocity of flow of the helium II at 
various temperatures, this work has brought to light an interesting 
feature of the flow itself which, in our opinion, warrants a preliminary 
report at this time. 

The beaker A (fig. 1) is of stainless steel, 4 mm. internal diameter, 
1 mm. wall thickness. A polished 2 mm. wide flat on the outer surface 
of the beaker is used for the optical measurements of the thickness of the 
helium film, while a glass capillary B, attached to the beaker by means 
of a copper-nickel tube and a kovar-glass seal is used for the liquid level 


Fig. 1. 


Apparatus for simultaneous measurement of film thickness and creep rate. 
A, stainless steel beaker; B, glass capillary tube; C, radiation shield ; 
D, centering disc; H, glass beaker on support; F, helium dewar. , 


measurements. The beaker is mounted in a brass radiation shield C 
and is supported at the bottom by means of the copper-nickel capillary 
tube. A disc D is attached to the capillary tube to maintain the beaker 
central in the dewar. The assembly is then suspended from a glass and 
tungsten support to which is attached a glass beaker E, which holds liquid 
helium during the experiment and which provides additional thermal 
protection to the stainless steel beaker. The apparatus is mounted in 
the dewar F which contains the liquid helium, the latter being protected 
by liquid hydrogen and liquid nitrogen in the outer dewars. 
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The creep rates observed are of the same order of magnitude as those 
reported by Mendelssohn and White (1950) for cleaned platinum and 
considerably in excess of those for glass beakers. Although the stainless 
steel beaker has mono-layers of barium stearate on the outside, it 
apparently behaves as a “clean beaker” in the sense of Bowers and 
Mendelssohn (1949), since the creep rate is practically independent of the 
height of the inner liquid level except for the first few millimeters, where, 
as usual, the creep rate is considerably greater (see fig. 2). The 


Fig. 2. 


LIQUID HEIGHT CMS. 


O 2 3 4 5 6 
TIME MINUTES 
Creep curve at 1:89°K. Height of liquid helium II inside beaker v. time. 
A, drops decreasing in number; B, very few drops visible ; C, no drops 
visible. 


accompanying table gives the measured values of the creep rates and the 
thickness and the deduced value of 0, the helium velocity averaged over 
the film thickness. The recorded thicknesses are those obtained when 
the point of observation of the thickness is 1 cm. above the main liquid 
helium level. The variation of the thickness of the moving film with 
height will be dealt with in a more detailed communication. 


°K Creep rate Thickness Velocity v 
wes em.3/cm./sec. cm. cm./sec. 
16-9 x 10-° 1-63 x 10-8 72 


1-1 

1-3 mato oc) 1-63 x 10-® 72 
1-5 16-8 x 10-° 1-66 x 10-® 68 
1-7 16-1 10-5 182 «10-6 60 
1:9 127°3x10-* 1-94 x 10-§ 29 
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A very striking phenomenon is observed when the full beaker ‘is 
withdrawn from the main body of the liquid helium II at the commence- 
ment of measurements. The mirror is seen to be covered with a large 
number of bright specks of light moving down and disappearing into the 
liquid helium below. Fifty or more may be visible at once immediately 
after the beaker has been raised. As the level of the liquid in the beaker 
falls the number of specks decreases steadily, the last ones disappearing 
in the present case when the liquid is some 3 mm. from the rim of the 
beaker. The bright specks, which are obviously “drops” of liquid 
helium, are somewhat elongated in the direction of motion, are 1/10 to 
1/3 mm. long and move downwards at a speed of about 1 cm. per sec. 
It is not yet possible to state anything definite about the dimensions of 
the drops perpendicular to the mirror. They are certainly much greater 
than the thickness of the normal creeping film, which is observed when 
the drops have ceased. The drop thickness is greater than can be 
measured with the existing mica compensating plate and the drops are 
too small for it to be possible to say whether they show interference 
contours. In fig. 2, A marks the position at which the observer reported 
that the number ar drops visible had decreased appreciably, B that at 
which only a few individual drops were visible on the mirror, and C that 
for which drops were no longer seen. It will be seen that the region in 
which the drops are present coincides closely with that in which an 
anomalously high creep rate is observed. There seems little doubt that 
the straight part of the creep curve represents a flow of helium 
characteristic of the continuous film and that in the initial curved part 
the excess helium is transported in the form of discrete drops. It will be 
noted that these drops move much more slowly than the film itself and 
presumably slide over the surface of the film. 
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